FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCT GAAGACTAACATT TTGTGAAGTTGTAAAACAGAAAACCT GTTAGA AATG TGGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAAT GCT TATT TGGGGCAATGCTAAATATTGCGGCAGTTTTATGCAT TG 
CTACCATT TAT GT TCGTT ATAAGCAAG TTCATGCTCTGAGT C C TGAAGAGAACGT TAT CAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
M TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
|y ATCAGTTTTGCACAGTGGCAAT TTTGGGACTGAT TTAGAACAGAAACTCCAT TGGAACCCCGAGG 

U 1 - ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 

7i GGT T T T T T CCT GACT TACAT TCGT GAT T T T CAGAAAAT T T CT T TACGGG T GGAAGCCAAT T TACA 

111 

1 ' T GGAT TAACCC TC TATGACACT GCACC T TGC CC TATTAACAATGAACGAACACGGC TACT T TCCA 

p GAGATAT T TGATGAAAGGATAAAATAT T TCT GTAATGAT TAT GAT TCT CAGGGAT T GGGGAAAGG 

l?1 TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT GAATGCTGATAAT CAGGAAACATGAAAGAAGCCAT TT GATAGAT TAT T CTAAAGGAT ATCAT 



CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCI AT I YVR YKQVHAL S P E ENV 1 1 KLNKAG LVL GILSCLGLS I VANFQKT T L FAAHVS GAVL T FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features : 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains : 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
|d? CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
O CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
T GAC T CT CAT GG CTCTTCTTTCT GGAT TTGGTGCT GT CAAC T GC CCAT ACAC T TACATGT CT T AC T T C 
? ? ? CTCAGGAAT GT GACT GACACGGATATT CTAGCCCTGGAACGGCGACT GCTGCAAACCAT GGATAT GAT 

ill 

CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
m AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
HI CT TAT TCAACAGGAAGT GGAT GCT TTGGAAGAATTAAGCAGGCAGCT TTTT CT GGAAACAGCT GATCT 

■ ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 
W ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
JfJ GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
2 GAAGTTTT GGTC CCAACACAT TT CCTTCATT CTT GT TGGAATAATCATCGTCACAT CCAT CAGAGGAT 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
f|| CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCACCT TGAACT TAAGCCTACTACAGACTGT TAGAGGCCAGTGGT TT CAAAATTT A 
GAT AT AAG AG G G G G G AAAAAT GG AAC C AG G GC C T GAC AT T T T AT AAAC AAAC AAAAT GC TAT G G TAG C 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE A 

MS FLIDSS IMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVI FSVTFAFSCTMFELI I FEILGV 
LNSSSRYFHWKMNLCVILLILVFIWPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTIJYLALLSGFGAW^ 

TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TWYLGIQFDWFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGELQFNFYHRW FDVI FLVSALS S I L FLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 
amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE 5 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATGG CCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
G TCATGCAGAAT TATGGGGATCACCC T T GT GAGCAAAAA.GGCGAACCAGCAGC TGAATT T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TAT CACCACCAAAGAT CCCATAT TCAACAC TCAAAC T GCAACACAAACAACAGAATT TAT T GT 
M CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
p CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
GAAACTAGCACCATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGA 
3f AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
f[ CAGCTGGTCTTGGATTTTGCTATGTCAATVAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
;UJ CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
ft! TGAGGAATCAAAGAA^CTGATAAAAACCCAGAAGAGTCCAAGAGTCCAAGCAAAACTACCGTGC 
fp GATGCCT GGAAGC TGAAGT T TAGATGAGACAGAAATGAGGAGACACACCT GAGGCTGGT T T C T T T 

CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
^ 8 CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
* GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
P TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
||i CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
m AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
Ifi GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
p GCTAAAATAAAGAAATAGAACAAGGCTGAGGATAC GACAGTACACT GT CAGCAGGGACT GTAAAC 

111 ACAGACAGGGTCAAAGTGTT TTCTCTGAACACAT TGAGT TGGAATCACTGTTTAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC T C T TAT AAAT T T C TAT T T T TAT C T GAG T T AC AG AAAT GAT T AC T AAGG AAGAT T 
AC T CAGT AAT TTGT TTAAAAAGTAATAAAAT T CAACAAACAT T TGC TGAATAGCTAC TAT ATGTC 
AAGTGCTGTGCAAGGTATTACACTC TGTAATTGAATAT TATT CCTCAAAAAAT TGCACATAGTAG 
AACGC TATC T GGGAAGC T AT T T T T T TC AGT T T TGAT AT T T CT AGC T TAT CT AC TT C C AAAC TAAT 
T T T TAT T T T T G C T GAG AC T AAT C T TAT T CAT T T T C T C T AAT AT G GC AAC CAT T AT AAC C T T AAT T 
TAT TAT TAACATACC TAAGAAG TACAT T GTTACC T C T ATATACCAAAGCACAT T T TAAAA3TGCC 
AT TAACAAAT GTAT CACTAGCCC TCCT T T TT CCAACAAGAAGGGAC TGAGAGATGCAGAAAT AT T 
TGTGACAAAAAATTAAAGCAT TTAGAAAACTT 
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FIGURE 6 

MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L SLAGKDQVETALKAS FETCS YGWVGDGFWI S RI S PNPKCGKNGVGVL I WKVPVS RQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVmETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLV^LLFFGAAAGLGFCYW 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 



Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 



HI Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGC GCAGCGCGGT GGAAGAGAT GGAGGCAGAAGAAGCT GC TGCTA 
AAGCAT CAT CAGAAGT GAACC TGGCAAACT TACC T C CCAGC TAT CACAAT GAGAC CAACACAGAC 
ACGAAGGT T GGAAATAATAC CATC CAT GT GCACCGAGAAAT T CACAAGAT AACCAACAACCAGAC 
TGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
M ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
[3- ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
p| GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
f n I ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
0; CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
PI GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
||] CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
s CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 
PI TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
!H CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
J* GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
W TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
O TGAT TGGT TT TGGGGAAAT G TGGAGAAGAGT GCCC TGCT TT GCAAACAT CAACCT GGCAAAAAT G 

f|J CAACAAATGAAT T T TCCACGCAGT TC T T TCCATGGGCAT AGGTAAGC T G TGCC T T CAGCT GT TGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACC T C TGT GCCAGGGCAGCAT T T TCATAT CCAAGAT CAAT T CC C TCTCTCAGCACAGCCT GGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
C AAG T C AC AC AG C TAG T G AAG AC C AGAG C AG T T T CAT CTGGTTGT GAC T C T AAGC T C AG T G C T C T 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT TGACAC TGTCCCTCTTT GGCAGT T GCAT TAGTAACT T T GAAAGGTAT AT GACT GAGCGTAGCA 
TACAGGTTAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGC 
AAAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
T GT GTGAAACATGGTTGTAATAT GCGACT GC GAACACT GAAC TC TACGCCACT CCACAAATGAT G 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T G TAT AAGCAT GCTTTCTT TGAG T T T TAAAT T AT GT AT AAAC ATAAGT TGCAT T TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWLANLPPSYHNETN^ 

TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation si te . 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATARATACATCATGCiVACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT T GACCACAAGC T CAGAGGAGT GCAGAACC CTGT AGCCCGC TGCATCATGTGCTGTT T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCT CAACTATTACT GGCTGCCCATCATGACCT CCAT CCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACC TGGAGCGGAACAACGGC TC CCT GGACCGGCCCT ACTACAT GTCCAAGAGCC T TC TAA 
AGAT T C T G G GCAAGAAGAACG AG GC G C C CC C GGACAACAAGAAGAGGAAGAAG TGACAGC T CC GG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACC TCCGT CT C TAT TAAAAATACAAAAAT TAGCCGAGAGT GGTGGC ATGCACC T GTCAT C C CA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GAT CGCGCCAC TGCAC T CCAACCTGGG TGACAGACTCT GT C T CCAAAACAAAACAAACAAACAAA 
AAGATT T TAT TAAAGATAT T T TGT TAACTC 



FIGURE IP 



RTRGRTRGGCEKVPINT SCNPTAHLVNSSCPGLMCVFQGYSSKGL I QRSVFNLQ I YGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PT FPL I SAFI RTLRYHTGSLAFGAL I L TLVQI ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
I VRVWLDKVTDLLLFFGKLLWGGVGVLS FFFFSGRI PGLGKDFKS PHLNYYWLP IMTS I LGAY 
VI AS G FFS VFGMCVDT LFLC FLEDLERNNGS LDRPYYMS KS LLK I LGKKNE APPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins . 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 



APP ID-1 0063685 



Page 164 



FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATCGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
T TT GTCAC CTGGTCAGCCCTATCCAGTAT CCCTGAACAGAAAT GCAACCCC CAT TTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GAG TTCGTGCCT T AC T GAGTC T C T AAGAC T T T T T CT AAT AAAC AAGC C AGT G C GT GT AAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPEMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVIASLHVMMTLTNWYKPGETRKMI STWTAVWVKI CASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 



|l| amino acids 1-20 

ffi 

;Z Transmembrane domains: 

yi 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
P 272-283, 324-340, 391-406, 428-444 

I '0, 

?"\ 
M 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGG ATGA ACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGAT TGAGT CCTATGAAGGAAGGGAAAAGAAAGGCATAT CT GATGTCAGGAGGAC 
TTTCTGTTTGTTTGT CACCT T T GAC C T CT TATTCGTAACAT TAC T GTGGATAATAGAGT TAAATG 
TGAATGGAGGCATTGAGAACACAT TAGAGAAGGAGGT GATGCAGTATGACTACTAT TCT TCATAT 
T TTGATATAT TT C T TC T GGCAGT T TT T CGAT TTAAAGT GT TAATAC T T GCAT ATGC TGT GT GCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCAC TT TTAGAAC T ATGA GTACT ACTT T T GT TAAAT GT GAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAG TGGAGT C TCCC TGT CGACAG TAAAGT T GAAATGGT GACGTC 
CAC TGC TGGC TT TAT T GAACAGC T AATAAAGAT T TAT T TATT GTAATACCT CACAAACGT T G TAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGAC T T AAT AT AT T G AAG T AAC AC x T T T T TAG T AAG C AAGAT AC C T T T T TAT T T C AAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C TTGCAAAAGGGGAAGAAAGGAAT T GC GAAT ACATGT AAAAT GTCACCAGACATTT GTAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAAT GAC T T AAAC CAT T CA T A T CAT GTTTCCTTTGCGTT C AG C C AAT T T CAAT T AAAAT G AA 
C TAAAT T AAAAA 



FIGURE izl 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARISSYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNVNGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVT S AFLLAKVT LSKL FSQGAFGYVL PI IS FI LAW I ET W FLDFKVLPQEAEEENRLL I VQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein : 
Signal peptide: 

amino acids 1-20 

|fl Transmembrane domains: 

W amino acids 54-72,-100-118, 130-144, 146-166 

sp- N-myristoylation sites. 

5 * amino acids 14-20, 78-84, 79-85, 202-208, 217-223 

p 

If! 

ftv 
I II 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCGCTACATC TAAG CCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCAC GTAAGT T C TCAGT C CCAACCT CGGGGATGT GT GCAGACAGGGC TGT GTGACCACAGC T 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
TTTAGGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCT 
G GGAAG AT G T T T T T C AAAC T CAGAGAC AAG G AC TTTGGTTTTTG T AAG AC AAAC GAT GAT AT G AA 
GGCCT T T TGT AAGAAAAAAT AAAAGAT GAAGT GT GAAA 



FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 

MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY ' 

LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTAISrVE 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 

AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 

TRCHFP PKNAGRLLLE L DYADFGC PAT TTT AT VP T TRP WRE PTALS S S LAPT WL S P TAPAT EAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAAIAAyGAAYCVRRGRAMAAAAQDKGQVG PGAG PLE LEGVKVP LE PGPKATEGGGEALPS G SE 

CEVPLMGFPGPGLQSPLHAKPYI 

Important features : 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N~glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGAT GAATCCT TAGATTCCAAGACTACTT TGACAT CAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
AT TAGAATCCTCTAT TCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGAT AT C AG C T T T C T AG AGT C T C C AAAT C C AG AAAAC AAG G AC TAT G AAG AGC C AAAG AAAGT A 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T TCC TAGATAAGGAGTAT GATGAATGTACATCAGAT GGGAGGGAAGAT GGCAGACTGTGGT GTG 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGC AGAAAT G ATG TAT C AAAC T GGAAT G AAAATC C T TAAT GGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
GCAGCGAGAGAGATGTT TGAGAAGCTGACT GAGGAAGGCTCTCCCAAGGGACAGAC TGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCT TGGGGGCAAT C TAATAGC CCACAT GGT T TT GGTAAGT AGAC T T TAG TGGAAGGC T 
AATAATAT T AAC AT C AG AAGAAT TTGTGGTT T AT AGC GG C C AC AAC T T T T T CAGC TT TCATGATC 
C AG AT T T G C T T G TAT T AAG AC C AAAT AT T C AG T T GAAC T T C C T T C AAAT T C T T G T TAAT G GAT AT 
AACACAT GGAAT CTACAT GTAAATGAAAGT T GGTGGAGT CCACAATTTT TCT T TAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATA&ATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTGTCAGAATCAT T TT T TACATTAGAT TATCATAAT TT TAAAAAT T T T T C T T TAGT T T TT CA 
AAAT T T TGTAAAT GGT GGC TATAGAAAAACAACAT GAAATAT TATACAATAT T T TGCAACAAT GC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACT CAT T TT TAATAAAAT TAT GTCTAAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 18 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALV Y YT FGALGGNL IAHMVXVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATT CAGAT TT TAAGCCCAT TC TGCAGT GGAAT T TCAT GAAC TAGCAAGAGGACAC CAT CT T C TT 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATT T T TAT CAC T GGAT GT GACTCGGGC T T T GGAAAC T TGGC AGCCAGAAC T T TT GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGC TCCCAC TGAC TGGCT GACAC TAGAGGACTACAGAGAACC TAT TGAAGT GAA 
CCTGTTTG GAC T CAT C AG T G T GAC AC T AAAT AT GC TTCCTTTGGT C AAGAAAGC T C AAGG GAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
AT GCAT TGAACCAGGAT TGTTCAAAACAAACTTGGCAGATCCAGT AAAGGTAAT TGAAAAAAAAC 
TC GCCAT TT GGGAGCAGCT GTCTCCAGACAT CAAACAACAAT AT GGAGAAGGT TACAT T GAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AAT TGGCC GATT T CAAGAACACATC TCC T T T TCAACCC CAT T CC T TAT C TGCTCCAACCT GGACT 



ass CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

HI 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CATC TC T T ATC TAAAT AT TAAAAGAT AAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAART FDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPE^T^RTAQWVKNQVGEKGLWGLINNAGVPGVIAPTDWLT 
REP I EVNL FGL I S VT LNML P L VKKAQGRVI NVS S VGGRLAI VGGGYT PSKYAVEGFNDS LRRDMK 
AFGVHVSCIEPGLFKTNLADPVK^ 

L S P WECMDHAL T S L FPKTH YAAGKDAKI FW I PL S HMPAAL QDFL LLKQKAE LAN PKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TAC T GAT TCCCAAATGGATGAT GT TGAAGT T GTT TATACAAT T GA 
CAT TCAGAAATATAT TCCAT GCTATCAGCT TT TTAGCT TT TATAAT TCT T CAGGCGAAGT AAATG 
AGC AAGCAC T GAAGAAAAT AT TAT C AAAT G T C AAAAAG AAT G TGG TAG G T T G G T ACAAAT TC C G T 
CGT CAT TC AGAT CAGAT CAT GACGT T T AG AGAGAGGC T GC T T CACAAAAAC T T GCAGGAGC AT T T 
T TCAAAC C AAGACCT T G T T T T TC T GCT AT T AAC AC CAAGTAT AAT AACAGAAAGC TGCT CTAC T C 
r* is ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
jjj GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
m T T T TAGCCGAGC AG TACAAACAC ACAGCT CTAAAT T T T T T GAAGAAGAT GGAT CCT T AAAGGAGG 

hi TACATAAGATAAATGAAAT GTATGCT T CATTACAAGAGGAAT TAAAGAG TAT ATGCAAAAAAGTG 

ffl GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
f^jj GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
pa ATGTCT TTAAAAAATAGACATGT T TC TAAAAGTAGC T G T AAC TACAACCAC CATC T CGAT GTAGT 

Iff AGACAATCT GACC TTAATGGTAGAACAC AC T GACAT TCC T GAAGC TAGT CCAGCTAGTACACCAC 

6! AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTA 
|'y GATACAC AAGACAAAC GAT C T AAAGC AAAT AC TGG TAG TAG T AAC CAAGAT AAAGCAT C CAAAAT 

»{ GAGCAGCCCAGAAACAGAT GAAGAAAT T GAAAAGATGAAGGGT TT TGGT GAATAT TCAC GGT C TC 

III 

CTACATTTTGATCCTTT TAACCT TACAAGGAGAT T TT TTTATTTGGCTGATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTTTGCAGTAATACACAGATAACTCT TAGTGCAT TTACTTCACAAAGTACT TTTTCAAACAT CA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTCTAT TAAAAAATAAAATGGAAAAGCAAGAATAGCCT TAT T TT CAAAAT AT GG AAA 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CAT TAT GGC TAGAGTT GCCAGATAAAATGC T GGAT AT CATGCAATAAATT TGCAAAACAT CAT CT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGES TSAVLSGFVLGALAFQHLNTDSDTEG FLLGEVKGEAKNS I TDSQMDDVEWYT I DI QKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE I EKMKGFGE YSRS PT F 

Important features: 
Signal peptide: 
amino acids 1-19 

N-glycosylation sites . 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 2ft 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCGACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGGTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGGACGGTTCGGCTTGCCCGCGCTTCTCTGGACCCTGGCAGTGCT.C 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
8 AACAAGAACGAT GC CCCCCT C CACGAGAT CAACGGT GAT CATT T GAAGATCT GT CCCCAGGGT T CTACCTGCTGCT CT 

" m CAAGAGAT GGAGGAGAAGTACAGC CT G CAAAGT AAAGAT GAT TT GAAAAGT GT GGTCAGCGAACAGT GCAATCATTT G 

gsr.* CAAGCT GT CTT T G CTT CACGTTACAAGAAGT TT GAT GAATT CTT CAAAGAACTACTT GAAAATGCAGAGAAATCCCT G 

?$~ s AAT GAT AT GTTT GT GAAGACATAT GGCCATTT AT AC AT GCAAAATT CT GAGCTATTTAAAGAT CTCTT C GT AGAGTT G 

I . i AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
TTCCGCCT GGT GAACT C CCAGTAC CACT TTACAGAT GAGTAT CTGGAAT GT GT GAGCAAGT ATACGGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
AT GAT CT ACTGCT C CCACT GCC GGGGT CT CGT GACT GT GAAGC CAT GTTACAACTACT GCT CAAACAT CAT GAGAGG C 
T GTT TGGCCAACCAAGG GGAT CT CGATT TT GAAT GGAAC AAT TT CAT AGAT GCT AT G CT GAT GGTGGCAGAGAGGCTA 

ap GAGGGT CCTTT CAAC ATT GAAT CGGTCAT GGAT C CC AT CGAT GT GAAGATTT CT GAT GCTAT TAT GAACATGCAGGAT 

Ui 

AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 

I:* I 

frf* TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
?*% AGTTTGGACCGACT G GTT ACT GAT GT CAAGGAGAAACT GAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 

if 1 1 GTT T GCAACGAT GAGAGGAT GGCT GCAGGAAACGGCAAT GAGGAT GACT GT T GGAAT GG GAAAGGCAAAAGCAGGTAC 

CT GTTT GCAGT GACAGGAAAT GGATTAGC CAACCAGGGCAACAACC CAGAGGT C CAGGTT GACACCAGCAAACCAGAC 
ATACT GAT CCT T CGT CAAATCATGGCT C T TC GAGT GATGAC CAGCAAGAT GAAGAATGCATACAAT GGGAAC GACGT G 
GACTT CT T T GAT AT CAGTGAT GAAAGTAGT GGAGAAGGAAGT GGAAGTGGCT GT GAGTAT CAGCAGT GCC CTTCAGAG 
T TT GACTACAAT GCCACT GAC CATGCT GGGAAGAGT GC CAAT GAGAAAGC CGACAGTGCT GGT GT CCGTCCT GGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAG ATAAT TCTCAAACTCTGAG 
AAAAAGT GT T CATCAAAAAGT TAAAAGGC AC C AGTT ATC ACT TT T CT ACCAT CCTAGT GACT TT GCTTT TTAAAT GAA 
T GGACAAC AAT GTACAGTT TT T ACTATGT GGCCACT GGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AG GAAAAGGGACT GT GCAT T GAGTT GGTT CCT GCT C CCC CAAAC CAT GT TAAAC GT GGCTAACAGT GTAGGT ACAGAA 
CTAT AGT TAGT T GT G CATT T GT GATT TT AT CACT CTATT ATT T GT T T GTAT GTT TT TTT CT CATT TC GTTT GT G GGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TCTGAAATATTAAATAGCT GTACAGAAGC AG GTTT TATTTAT CAT GT TAT CT TATTAAAAGAAAAAG CC CAAAAAGC 



y i 
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FIGURE 24 

MARFGLPALLCTIAVLSAAL 

CCS QEMEEKY S LQS KDD FKS WS EQCNHLQAVFASRYKKFDE FFKELLENAEKS LNDMFVKT YGH 
LYMQNSELFKDLFVELKRYYWGNTOLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDA^ 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWKGKGKSRYLFAVTGNGIANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADS AGVRPGAQAYLLT VFC I L FLVMQREWR 

Important features : 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCT CAAAT GG GAAC GC T G GCC T GGGAC TAAAG C AT AGAC C AC C AG G C T GAG TAT C CT GAC 
C T GAGT CAT CCCCAGGGAT CAGGAGC CT CCAGCAGGGAACCT T C CAT TAT AT TC T T CAAGCAAC T 
TACAGCTGCACCGACAGTTGCG ATGA AAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAAT TCATGACAGT GT CT GGGC TGCCAAAGAAGCAGTGCCCC T GTGAT CATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTG TAG GAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGC TAG TGTCAT T T AACCT T AAAT GCAAT CAGGAAAG T AGCAAACAGAAGT CAATAAATATT T T T 
AAATGT CAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVL I SS LLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
T GGCCGTGC TGAC TCCCAAGAGCAACCGCAAGAT GGAGAGCAAGAAAC GGGAGC TC TTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
Uh GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

0 CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGAGCGGGAGGTGTGC 
|;f TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
Z$ GAGCCCACCT GGAAACACAT TGGGGAT GGCT GC T GCCT CACCAGAGAGACCT GGAAGGATCT TGA 
|ll GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 

10 GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

1 Irs 

T T CCCCAGCCTCCAAT TAGAACAAGCCACC CACCAGCCTAT CTATCT T CCACTGAGAGGGACC TA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
in AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
O AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
vy CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 

i;f! CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

111 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGACT CAAAGAGGCAGAGGT TTTGTTCT CAAAT AT T T T T TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHIMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDYWCTLVLCSVQSPRKVXQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 



Leucine zipper pattern. 



amino acids 10-32 



Hi 
111 



N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT TATGATTCTTATACTAATTTATACAAAGATAT TAAGGCCCTGT TCAT TAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 

l£ Growth factor and cytokines receptors family: 

£.1 amino acids 3-18 

0 

m 
w 
m 
m 
m 

.5 

0 
If! 

a- 
«i 
o 
m 
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FIGURE 31 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
T T CAGAT T CATC ACCAC CC T T C T GGT T CACAT T T TCAT T T CAT T GGT TAT T T TGGGATT GT T G T T 
TGTCTGCGGTGTTT TAT GG T GGC TGTAT TAT GACTATACCAACGACC T CAGCATAGAAT T GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
H CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
M AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
2 TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 

y s 

y j ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
pi AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
Hi T GG T T AC T T GT T AT T T CAACAGAAGT AAAAAT GAT C C TCCT GATCATCC CAT CCTTTCGTCTCTC 

WS TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
m GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
fil CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
Q AACCAGAATGGkTATACTACAACTGCTATTAATGGGA^ 

5 ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
J:^ TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAAC AAATGAT GGAT CGT CAGAAAAGCCC TAC T T TAT GGAT CAAGAAT T T C T GAGT T T CGTA 
AAAAGGAGCAACAAAT TAAACAAT GCAAGGGCACAGCAGGACAAGCACT CAT TAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCAT TGTGAGATAGATACCCATT TAGGTATCT GTACC TGGAAAACAT T 
TCC TTCTAAGAGCCATT TACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAATTTTT T T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGV 

LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
I L I FFWYLWVAVLLSLGTAGAAQVMEGGQVE YKPL S G IRYMWS YHL I GL I WT SE FI LACQQMT IA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GT T CGAT TAGCTCC TC T GAGAAGAAGAGAAAAGG T TC T T GGACC TCTCCCTGTTTCT TC CT T AGA 

ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 

GAAAAT TT T T TGAAAAAAAAAT TGCCT T C T T CAAACAAGGGT GTCAT T C T GAT AT T TATGAGGAC 

TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 

ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 

GATGTCAAA.GCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 

CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 

TACACAG TGGTGTGCTT GATAAT TCAGGAGGGAAAAT AC T T G T TC GGAAGGT T GCT GGACAGT CT 

GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 

TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 

CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 

ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 

CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GC CACAGGAGCCAGGAGAT GGATC TCT GGT CCACT GCC ACC T ACACAAGCAGCCAAAACAGGCCC 

AGAGCT GAT CCAGG TAT CCAAAGGCAAGAT CC T TCAGGAGC T GCC T T CC AGAAACCTGT T GGAGC 

G GAT GT C AG C C T G GG AC T T G T T C C AAAAGAAGAAT T GAG CAC AC AG T C T T T G GAGC C AG TAT C C C 

TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

CGG TCCACT GAT GGGT G T T GT CCAGTAT GGAGACAACCC TGC T AC TCACT TT AACC T CAAGACAC 

ACACGAAT T C TCGAGAT CT GAAGACAGCCATAGAGAAAATT AC TCAGAGAGGAGGACT T T CTAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 

AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 

TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GT CCAAGCCCAACAAGAGGAAGT TAAT GATCC TCAT CAC CGACGGGAGGT CCTACGACGACGT CC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

GC CCAAGAGGAGCTAGAAGT CAT TGCCACTCACCC CGC CAGAGACCACT CCT T C T T T GT GGACGA 

GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 

AGC C TCGGAAC TGA AT T CAGAGCAGGCAGAGCACCAGCAAGT GCT GC TT TACT AAC T GACG TGT T 

GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 

AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 

TGAT CACAAACGTATAGAAT GAGC CAAAAGGC TACAT CATGT T GAGGGT GCT GGAGAT T T TACAT 

T TT GACAAT T G T T T T C AAAAT AAAT G T T C G GAAT ACAG T GC AGCC C T T AC GACAGGC T TAC G TAG 

AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

T T T TGT CAT GACAATGTAGGAAT T GC TGAAT TAAAT GT T T AGAAGGAT GAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 



MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKI IDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ES FIVLESKPKKGVTYPSALT YS S SKS PAAQAGETTKAYQRPPI PGTTAQPVTLMQLLAVTVAVA 
T PT TLPRPS PSAAS T T S I PRP QS VGHRS QEMDLWS TAT YT S S QNRPRAD PG I QRQDP S GAAFQKP 
VGADVSLGLVPKEEL S TQS LE P VS LGDPNCKI DLS FL I DGS T S IGKRRFRI QKQLLADVAQALD I 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKI TQRGGLSNVGRAI S FVTKNFFSKANGN 
RSGAP1SF\AAA/MVT)GWPTDKV^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI LIT DGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQEELEV I AT HPARDH S F F 
VDE FDNLHQ YVPR IIQNICTEFNS QPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 



III N-glycosylation sites. 

amino acids 390-394, 520-524 



N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 



Amidation site. 

amino acids 304-308 
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FIGURE 35 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT CAGCCT TC CCCACT AC AATGTG AT AG AACGCGT G AACT GG ATGT AC TT CT AT GAGT AT GAGCCGATT T A 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
jU TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
Q TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
C»f ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 
«M ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 

is! 

2* AAGAC AGACACTGATGTTTTCAT CAATACTGGCAATTT AGT GAAGT AT CT T T T AAAC CT AAACC ACTCAGA 

GAAGT TTT TC AC AG GT TATCCT CT AATT GAT AAT T ATT CC TAT AGAGGAT T T T ACC AAAAAACCCATATT T 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 

Iff 

TTGGTGCC AAGGAT CT ATGAAATGAT GGGT CACGT AAAAC CC AT CAAGT T T GAAGATGTT T ATGTCGGGAT 
CT GT TTGAAT TT AT T AAAAG TG AAC ATT CATAT T CC AGAAGACACAAATCT T TT CT TTCT AT ATAGAATCC 
Hi AT TT GGAT GT CTGT CAACT GAGACGT GT GATT GC AGC CCATGGCTTT T CT T C CAAGGAGATC AT CACTT TT 

Q TGGCAGGTCATGCTAAGGAACACCACATGCCATTAT TAAC TTCACATTCTACAAAAAGCCTAGAAGGACAG 
CO GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
Q AC AC T GAACT GAAACT CATG AAAAAC CC AGACTG GAGACT GGAGGGT T AC ACT T GT GATT T ATT AG TC AGG 

CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CT AGGCTGT AAAAACAAAACAAT GT AGAGTTT T AT TTATT GAAC AAT GTAGT C ACT TG AAGGTT T T GT GT A 
TAT C TT AT GT GGAT T ACC AATT TAAAAAT AT ATGT AGT TCT GTGT C AAAAAACT TCTT CACT GAAGTT AT A 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
ATTATTTAAAATTACTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
TGAATCATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCACTCCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AGAGAAG AAT T AAAGC AAG AAAAT C T G AAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLL^ 

TLREHSNCSHQNPFLYILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site . 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGG &gGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCGTTCTCATC 
TCG TCCTTGCCAAGAGAGTAC ACAGTC AT TAATGAAGCC T GCCCTGGAGC AGAGT GGAATATCATGTGTCGGGAG TGC TGTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
T GAGTGTGACT C C TGCC TGATCCACCC AGGT T GTAC CATC T TTGAAAAC TGCAAGAGC TGCCGAAATGGCT C AT GGGGGGGT 
ACC T T GGATGACT TC TATGTGAAGGGGT TCTAC TGTGCAGAGT GC CGAGCAGGC TGGTACGGAGGAGAC TGCATGC GATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAACCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 
GTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGC TCC TC AT C CCC TT GT T TCCATGACGGC ACGTGC GTCC TTGACAAGGC T GGATCTTAC AAGTGTGCCTGC TT GGCAGGC 
TATACTGGG<:AGCGCTGTGAAAATCTCCTTGAAGAAAGKAACTGCTCAGACCCTGGGGGCCCAGTCAATGGGTACCAGAAAA 
T AAC AGGGGGC CC TGGGCT TATCAACGGACGC C ATGC TAAAAT TGGC ACC GT GGTGTC T TTCT T T TGT AACAACT C CTATGT 
T CT TAGTGGCAATGAGAAAAGAAC T TGCC AGCAGAAT GGAGAGTGGTC AGGGAAACAGCC CATC TGC AT AAAAGC C TGCCGA 
GAACCAAAGAT T TCAGACC TGGTGAGAAGGAGAGT T C TTC CGATGCAGGTT C AGTCAAGGGAGAC ACC ATT AC AC CAGC TAT 
ACTCAGCGGCCT TCAGC AAGCAGAAACTGCAGAGT GC CCC TAC C AAGAAGC C AGC CC T TC CCT T TGGAGATC TGC C CATGGG 
ATAC C AACATC TGCATACC CAGC TCCAGTATGAGTGC ATCTCACC C T TC TACC GC CGC C T GGGCAGC AGCAGGAGGACATGT 
C TGAGGAC TGGGAAGT GGAGTGGGC GGGCAC CATCC TGC ATCCC TATC TGC GGGAAAAT TGAGAACATCAC TGCT C CAAAGA 
C CCAAGGGT TGCGCTGGCCGTGGC AGGCAGC CATC TACAGGAGGACCAGCGGGGT GCATGAC GGC AGCC TAC ACAAGGGAGC 
_ Si . GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
%f GTC ACCATGAT CAAGACAGC AGACCTGAAAGT TGTT T TGGGGAAATT C TACC GGGATGATGACCGGGAT GAGAAGACC AT C C 

AGAGCCTAC AGAT TTC T GC TATCAT TC TGCATCCCAAC TAT GACCCC ATCC TGCT TGATGC TGAC ATCGCCAT CCT GAAGC T 
U C CTAGACAAGGC C CGTATCAGCACCCGAGTCC AGCC CAT CT GCC TCGC TGC C AGT CGGGATCTC AGC AC TT C C TTC CAGGAG 

m 

v\& T CCCACATC AC T GTGGC TGGC TGGAATGTCC T GGC AGAC GT GAGGAGC CC TGGCT TC AAGAACGACACACTGCGC TCTGGGG 

I J T GGTCAGTGTGGT GGAC TCGC TGC TGTGTGAGGAGCAGC AT GAGGACC AT GGCATC C CAGTGAGTGTC ACTGATAACATGTT 

ill CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
C GAGCATC TCC TGAGC CACGC TGGCATC TGATGGGAC TGGTCAGC TGGAGC TATGATAAAAC ATGCAGCCACAGGC TCTCC A 
C TGCCT TC ACCAAGGT GC TGCCT T T TAAAGAC TGGAT TGAAAGAAATATGAA ATGAA CCATGC TCATGCAC T C CTT GAGAAG 
T GT T TC TG TATATCC GT C TGTACGTGTGTCAT TGC GTGAAGCAGT GTGGGC C T GAAGTGTGAT T TGGCC TGT GAAC TTGGC T 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TAC T AGGACAGCCAATTGGAAGAT GCCAGGGCTT GC AAGAAGTAAGT T TC T TCAAAGAAGACCATATACAAAACC T C TCCA 
C TCCACTGACC TGGTGGTCT T CCC C AAC T T T CAGT T ATACGAATGCC ATCAGC TT GACCAGGGAAGATC TGGGCTTC ATGAG 
GCC C CTT T TGAGGCT C TCAAGT TC TAGAGAGC TGCC TGTGGGAC AGCC CAGGGCAGC AGAGC TGGGATGTGG T GC AT GC C T T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAAC T AC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 28 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S YPLNAHCEWT I HAKPGFVIQLRFVMLS LE FDYMCQYDYVEVRDGDNRDGQI I 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYT GQRCENLLEERNCSDPGGPVNGYQKI TGGPGL I NGRHAK I GT WS FFCNNS YVLS GNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYEC I SPFYRRLGS SRRTCLRTGKWSGRAPSC I P I CGKIENITAP 
jU KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
P KWLGKFYRDDDRDEKT I QSLQI SAI I LHPNYDP ILLDADI AI LKLLDKARI S TRVQP I CLAASR 

J! DLSTSFQESHITVAGWWIADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTD1SMFCA 
T'l S WE P TAP S D I C TAE T G G I AAVS FP GRAS PE PRWH LMG L VS W S YDK T C S HRL S TAF T KVL P FKDW I 

5 g s 

ff% ERNMK ' 

if Sri 

Ul Important features of the protein: 

3 

Signal peptide: 

o 

SjR amino acids 1-23 

C) EGF-like domain cysteine pattern signature. 

|;(| amino acids 260-272 

W N-glycosylation sites. 

" y amino acids 96-100, 279-283, 316-320, 451-455, 614-618 

N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAA.TCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGA ATGA AGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCT CC C ACTAC GGCCT GACCAAAGAT AGGAAGAGGC GCT CACAAGATGGCT GT C CAGAC G GCT GT GCGAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCTGC CTAC GT GTCCT CGG CAGAGGAC GGGCAGC CAGCAAT CAGC C CAGT GGAC T CT GGC CGGAGCAACC GAACTA 
GGG CACG GCCCTTTGAGAGAT C CACT ATTAGAAGCAGAT CAT TTAAAAAAATAAAT C GAGCTTT GAGT GTT CT T CGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAG 
5 : T CT T TC CAAGGTT GTAC CACCT GATT C CAGAT GGT GAAATTAC CAGCAT CAAGAT CAAT C GAGT AGATCCCAGT GAAA 

GCCT CT CT ATTAGGCT GGT GGGAGGTAGC GAAACCC CACT GGT C CAT AT CAT T AT CCAACACAT TT ATCGTGAT GGGG 
■ J»j T GAT CGCCAGAGACGGCCGGCTACTGC CAGGAGACATCATT CT AAAG GT CAAC GGGATGGACAT CAGCAAT GTCCCT C 

Iff | ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
3 „ I G CAGGAACAATGGACAGGC CCC GGATGCCTACAGACCCC GAGAT GACAGCT T TCAT GTGATTCT CAACAAAAGTAGC C 

ffl CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
fjrj GTGT GGCATAT CGACAT GGT CAGCTT GAGGAGAAT GACC GT GTGT TAGC CAT CAATGGACATGAT CT TCGATAT GGC A 

|f| GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
. s GGAGCCCT GACATCTTT CAGGAAGCCGGCTGGAACAG CAAT GGCAGCTGGT C C CCAGGGC CAGGGGAGAGGAGCAACA 

j»| CT CCCAAGCC CCTCCAT CCTACAATTACTT GT CAT GAGAAG GT GGTAAATAT CCAAAAAGAC CCCGGT GAAT CT CT CG 

gH GCAT GACC GT CGCAGGGGGAGCAT CACATAGAGAAT G GGATT TGC C TAT CT ATGT CATCAGTGT T GAGC C CGGAG GAG 

tfi ■ 

i?* f 3 T CATAAGCAGAGAT GGAAGAATAAAAACAGGT GACAT TTT GT TGAAT GT GGATGGGGT C GAACT GACAGAGGTCAGC C 

fif| GGAGT GAGGCAGT GGCATT AT T GAAAAGAACATC AT C CT CGAT AGTACT C AAAGCT TTGGAAGT CAAAGAGTAT GAGC 

p r S C C CAGGAAGACT GCAGCAGC CCAGCAG CC CT GGACT CCAAC CACAACAT GG C CCCACCCAGT GACT GGT C CCCAT CCT 

f ft 5 GGGT CAT GT GGCTGGAATTACCACGGT GCTT GTAT AACT GTAAAGAT AT T GTAT TACGAAGAAACACAGCTGGAAGT C 

T GGGCTT CTGCATT GTAGGAGGTTAT GAAGAATACAAT GGAAACAAACCTTT TT T CATCAAATC CATTGTTGAAGGAA 
CAC CAGCAT ACAAT GATGGAAGAATTAGAT GT GGT GAT ATT CTT CTT GCT GT CAAT GGT AGAAGTACAT CAGGAATGA 
TACATGCTT GCTTGGCAAGACT GCT GAAAGAACTTAAAG GAAGAATT ACT CT AACTATT GTTTC TT G GCCT GGCACTT 
TT TTATAGAAT CAAT GATGGGT CAGAGGAAAACAGAAAAAT CACAAATAG GCTAAGAAGT T GAAACACTATATTT AT C 
TTGT CAGTT TTTATAT TTAAAGAAAGAATACATT GTAAAAAT GT CAGGAAAAGTAT GAT CAT CTAAT GAAAGCCAGTT 
ACACC T CAGAAAATAT GAT TCCAAAAAAAT TAAAACTACTAGTT TTT TTT CAGT GT GGAGGATTT CT CATTACT CTAC 
AACAT T GTT T ATAT TT TTT CTATT CAATAAAAAGCC CTAAAACAACTAAAATGATT GATTTGTATACCCCACTGAAT T 
C AAG CT GAT TT AAATT TAAAATTT G GTATAT GCT GAAGT CT GCCAAGGGTACAT TAT GGC CATTTT TAATTTACAGCT 
AAAATATTT TTTAAAAT GCATT GCT GAGAAACGTT GCTT T CATCAAAC AAGAATAAATATT TTT CAGAAGTT AAA 
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FIGURE AO 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKWGMDISOTPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVILNKSSPEEQLGIKLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVIiAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWKSNGSWSPG 
PGERSNTPKPLHPT I TCHEKWNI QKDPGESLGMTVAGGASHREWDLP I YVI SVEPGGVI SRDGR 
IKTGDILLWDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites . 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE ai 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTG CAGT GAAGC AGAGAGAT AGATAT TAT TCACGT AATAAAAAAC ATG GGCT TCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACT GT CCT TCT GTGTCT CCTTACCT CAGAGGCCAGAGCAAGCT CATTTT CAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCAGCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCT GCAT CCCTTCCTGCAGAGGCAGCAGCT GGAT TATGGCAT CTACGTCAT CCACCAGGCT GAAGG 
TAAAAAGT TTAAT CGAGCCAAACT CTTGAATGTGGGCTAT CTAGAAGCCCT CAAGGAAGAAAAT TGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGT GGGT AAATATACAATGGTCTTCCACAC TAGAGACAAAGGCAAT GAGGTGAACGCAGAACG 
GATGAAGCT CTT ACACCAAGTGTCACGAGTCT GGAGAACAGATGGGT TGAGTAGTTGTTCT TATAAAT 
TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
CAT GAGGGTTAAATAT TGTAATATGGAT ACTT GAAGGACT T TATATAAAAGGATGAC TCAAAGGATAA 
AATGAACGCTAT TTGAGGACT CTGGTTGAAGGAGATT TATT TAAATT T GAAGTAATATAT TATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGT CTGAGAGAACCAGAGTTGT TCTCGT CCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAAT CAGGACACAGT GAACTT GGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACC T GCCCAGGTATGCCTTCCAGT GAT GCCCACCAGAGAATACAT TCTC TAT TAGT 
TTTTAAAGAGTT T TT GTAAAATGAT TTTGTACAAGTAGGATATGAATT AGCAGTT TACAAGTTTACAT 
ATTAACTAATAATAAATAT GT CT AT CAAATACCTCT GTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MG FNLT FHLS YKFRLLLLLTLCL T WGWAT SNYFVGAI QE I PKAKE FMANFHKTL I LGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNP L Y INI T VD FW FG A 



Important features: 
Signal peptide: 

p% amino acids 1-27 



If! 



N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 



p amino acids 191-202 

'I I 
-ass. 

S 5 



a 
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FIGURE A3 

GC TCAAGACCCAGCAG T GGGAC AGCCAGACAGACGGCACG ATGG CACT GAGC TCCCAGAT CT GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAAC TT GCAGAGCT GCAACCCCAGGACAGAGC TGGAGCCAGG GCCAGCT GGAT GCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCA?U\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MM.SSQIWAACLLLLLLIASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC T TCAACACAAC C CCT C TTGTCACCAT ACAGCCAGAAGGGGGCACTAT CATAGTGACCCAAA 
AT CGTAAT AGGGAGAGAGTAGAC T TCCCAGATGGAGGC TACT CCC T GAAGC T CAGCAAACT GAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAAT GGCAC CT GT GT GACCAAT CTGACATGC TGCAT GGAACAT GGGGAAGAGGATG TGAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
G T T T C T GAAGAGAGAGAGAC AAGAAGAG T AC AT T GAAGAGAAGAAGAGAG T G GAC AT T T G T C G GG 



*»« 

in AAAC TCCTAACATAT GCCCCCAT TCT GGAGAGAACACAGAGTACGACACAAT CCC TCACACT AAT 

$ AGAACAAT CC TAAAGGAAGAT CCAGCAAATACGGT TTACTC CAC T GTGGAAATACCGAAAAAGAT 

0 GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 

tfi TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VAI^PVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGT AAT GCCAAT T GTGAAT T T TCAT T GAAAAACAT CAG T GACAT TCAT CCAGAATCCTT CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 

u 

\^ ACACCATGGCGAGTGGCTGGAGAGCAT CTAGTT TCCACT T C GAT T C T GAAGAAAACAAACATAGG 

lit CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 

w 

f£| CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
|y T GTAGT T TAAT GGGAATAAAATG TAAG TAT CAGT AGT T T GAAAAAAAAAAA 

iri 
y 

ill 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGVVLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
T TMS LT ARKRACCNNRTGMFLS S FFS VI T VI GAL YCML I S I QALLKGPLMCNS PSNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSMDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains : 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 

i f 

p* N-glycosylation sites. 

||| amino acids 80-84, 132-136, 148-152, 163-167 

"f* cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

Q N-rayristoylation sites. 

Ifl amino acids 22-28, 54-60, 83-89, 97-103, 216-222 



ru 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAG T GAGGAACC TGCAGAGTCACACAGT TGC T GACCAAT T GAGC TG T GAGCC TG GAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 



Pi 



m 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAYLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE fU 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AG GAGACCC T GG T GGGAG GAAGAC AC T C T GGAGAGAGAG GG GG C T GGG C AG AGATGAAG T T C C AG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

C T TC AGAGGACAGGGAGT T TC CAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTC CT T G 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

GCATCCCGTGACC TCCAGACAAGGAGCCACC AGAT TGGAT GGGAGC CC CCAC AC TCCCTCCT T AA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAZW^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE S2 



■• :i K 



- w 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 



U amino acids 1-21 



N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE sa 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGG ATG TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
ps GAAGTCCTATATAACGATC T TCAACAAGAG T GCAAACAT CAT GC T TGACAAGTGGCAGCACCT GG 

£3 C CT CAGAGGGCAGCAG T CGT CTGGACAT GT T TGAGCACATCAGCC T CAT GAC C T TGGACAGTC TA 

IP CAGAAATGCATCT TCAGCTT TGACAGCCATTGTCAGGAGAGGCCCAGTGAATATAT TGCCACCAT 

CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
Si ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

m 

HI GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

S AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 

t»J CAT T GT CAGATGAGGAT ATAAGAGCAGAGGCTGACACCT TCAT GT T TGGAGGCCATGACACCACG 

*[j GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

Jrt ACAGGAGGTGCAAGAGC T TC T GAAGGACCGCGATCCT AAAGAGATTGAAT GGGACGACCT GGCCC 

few 

pi AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
f|| TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MSLLSLPWLGLRPVAMSPWLLLLLVVGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 

L I TPTEEGLKDSTQMSATYSQGFTVWLGPI I PFIVLCHPDT IRS I TNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 

SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDT TAS GL S WVLYNLARHPE YQERCRQE VQELLKDRDPKE I EWDDLAQLP FL TMCVKE S LR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 

LAFIPFSAGPRNCIGQAFAMAEMKVVLALMLLHFRFLPDHTEPRRKLELIMRAEG 

VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 



APP ID-1 0063685 



Page 208 



FIGURE 55 

AT CGCAT CAAT TGGGAGT ACCATC T T CCT CATGGGACCAGT GAAACAGC T GAAGCGAAT G T T T GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TC T GGAT GT T G TC C CAC TGAAT TCCCAT GAATACAAAC C TAT TCAGCAACAGCAAAAAAAAAAAA 
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FIGURE *6 

MGPVKQLKRMFEPTRLIAT IWLLCFAM^ 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 



APPJD=1 0063685 



Page 210 of 322 



FIGURE 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCGGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
G GAGAT AAGGAC CT T GAT GGG CAGCTAGAC T TT GAAGAATT TGT CCATTAT CT CCAAGAT CAT GAGAAGAAGCT GAGG 

CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
G GAGTCAAGATATCT GAACAGCAGGCAGAAAAAAT T CT CAAGAG CAT GGATAAAAACGG CAC GAT GAC CAT CGACTGG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGAT CATCCTCTACT GGAAGCATT CCACG 
ATCTTT GAT GT GGGT GAGAAT CTAACGGTC C CGGAT GAGTTCACAGTG GAGGAGAGGC AGAC G GG GAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCATGCCT CCCGCAGCAACAACAT GGGCAT CGT T GGTGGCTT CACT CAGATGATT C GAGAAGGAGG GGCCAGG 
TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ATCAAG CGCC TT GT T G GTAGT GAC CAGGAGACT CT GAGGATTCACGAGAGGCTT GT GGCAGGGT C CTT GGCAGGGGC C 
ATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
AT G CT GGACT GC G C CAGGAGGAT CCT GGCCAGAGAGGGGGT GG CCGCCT T C TACAAAG GCTAT GT C CCCAACAT GCTG 
GGC AT CAT CC CCTATGCCGGCAT C GACCTT GCAGT CTAC GAGAC GCT CAAGAAT GC CT G GCT GCAGC ACTATG CAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGC^GCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GT C CT GCT GACCC CAGCAGACCCTCCT GTT GGTT CCAGC GAAGACCACAGGCAT T C CTTAGGGTC CAG GGT CAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
C CT CTT G CTG CCT GCCT GTCT GCT GAGGTAAGGTGGGAGGAGGGCTACAG C CCACATC CC ACCC C CT CGT CCAAT C C C 
ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGC CT GGACC CT GT CAGGATGGGCCCCACCTCAGAACCAAAC T CAC T GT CC CCACT GT GG CAT GAG GGCAGT GGAGCA 
CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACT GTTGGGAAAAGGGT TTT GTCCAGAAGGACAAGC CGGACAAAT GAGCGACTTCT GT GCT T C CAGAGGAAGAC GAGG 

GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGAC CAGCC CCACATT CCACTT GT GT CAC T GCTT G GAAC CTATTTATTTT GTAT TTATTT GAACAGAGTTAT GT CCT 
AACTATTTT TATAGAT TT GTTTAATTAATAGCTTGT CATT T T CAAGTT CATT TTT T ATTCATATT TAT GT T CATGGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTT GGGGAAGG GCT T GC C CC CAGCCTTAGGATT TCAGG GTTT GACT GGGGGC GTGGAGAGAGAGG GAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAAT CTTCAATAGGATGCAAAGATCAATGCAAAAATTGTTATATATGAACATATAACT GGAGTCGTCAAAAAG 
CAAATTAAGAAAGAATTGGACGTTAGAAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT IDWNEWRDYHLLHPVENI PE I ILYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKGYVPNMLG I 1 P YAG I DLAVYE TLKNAWLQH YAVNS ADPGVFVLLACGTMS S T C 
GQLASYPLALVRTRMQAQAS IEGAPEVTMSSLFKHI LRTEGAFGLYRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE so 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACT CAT CAT T GGC T T TGGT AT T T C AGGGAGACAC TC CAT CACAGTCAC T ACT GTCGCC T CAGC T 
GGGAACAT T GGGGAGGAT GGAATCC T GAGCT GCAC T T T T GAACC TGACAT CAAAC T T T CT GATAT 
CG T GATACAATGGC TGAAGGAAGGTGTT T TAGGCT TGGT CCAT GAGT T CAAAGAAGGCAAAGATG 
AGC TGT CGGAGCAGGAT GAAATGT TCAGAGGCCGGACAGCAGT G T T T GCT GAT CAAGT GAT AGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
J* CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
pi ^AGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
§! CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
III CAGCTTTGAGCTGAACTCTGAGAA.TGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATAC TC C T GTAT GAT TGAAAAT GACATT GCCAAAGCAACAGGGGATAT CAAAGT G 

m 

?Z ACAGAAT CGGAGATCAAAAGGCGGAG TCACC TACAGCT GC T AAACTCAAAGGC T TCTCTGTGTGT 

J" CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
f1 GT GCCT TGGC CACAAAAAAGCATGCAAAGTCAT TGT TACAACAGGGATC TACAGAAC TATT TCAC 

ill CACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 
j| AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGC TC CAATAT GAACAAGATAAA.T 

Sj. CTATC T T CAAAGAC AT AT TAGAAGT TGGGAAAATAAT T CATG TGAAC TAGACAAG TGT GT TAAGA 

J|| GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGT GCATGT T C T T T GT CT CT GAAT T TT TAGT TAT AT GT GC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGACACCGAT TTTATAAATAAACTGAGCACCT TCTT TT TAAA.CAAAAAAAAAA?\A?W\Z\AAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DI VI QWLKE GVLGLVHE FKEGKDEL SEQDEMFRGRTAVFADQVI VGNAS LRLKNVQLTDAGT YKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 

NTSFELNSENVTMPCWSVLYNVTI1MTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALL P L S PYLMLK 

Important features : 
Signal peptide: 

amino acids 1-2 8 



N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 



Transmembrane domain: 



amino acids 258-281 



01 




N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 



APP ID=1 0063685 



Page 214 of 322 



FIGURE 61 



T GACGT CAGAAT CACCATGGCCAGC TAT CCT TACC GGCAGGGCT GC CCAGGAGCT GCAG GACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGGCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GG T GGACT CAGAT CACAGT GGC TATAT CTC CATGAAGGAGCTAAAGCAGGCC C TGGTCAACT GCA 
AT TGGTCTT CATTCAAT GAT GAGACC T GCCTCATGAT GAT AAAC AT G T T T GACAAG AC CAAG T CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
pi CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
W AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
1^ GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
?n GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
- CCATGACAGC TT C TCGGAT GC TATGACCCAACCAT CT GT GGAGAGTGGAGT G CACCAGGGACC T T 

0 TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
®* ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
25 AATAGT GAGGACCGGGGCT GAGGCCACACAGATAGGGGCCT GAT GGAGGAGAGGATAGAAGT T GA 

ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 



1 y 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
S G Y I SMKE LKQALVNCNWS S FNDE TC LMMI NM FDKTKS GRI DVYG FS AL WK F I QQWKNL FQQ YDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FE D FVTMT AS RML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6* 

CAGGATCCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCGCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCT GGGGAAATAT TAATCGGTATCCAGGAGGCAGC T GGGGGAATAT TAAT CGG TATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
| TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
1 T GC TCATC TCCAATAAAAT AAAAGC AC T TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

I AAAAASAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAi^^ 
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FIGURE 6 d. 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDIARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPJ^SWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSL1HRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRY.PGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 



amino acids 1-26 

m 
~$ - 

ty Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

s k;s 
S H 5 

N-myristoylation sites. 

0 amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

if! 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 

pi 
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FIGURE 6 s 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATGG GGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGGTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 . 

N-glycosylation site. 

amino acids 37-41 

N-myr i s toyl a ti on si tes . 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

ACGGACCGAGGG T TCGAGGGAGGGACACGGACCAGGAACC T GAGC TAGGTCAAAGACGC C CGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
m GCCTGCCCATCTAGGTCCCCT'CTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
B1 GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
W AGAC TC TGCCCC T GAGGTCAAGAGAGGAT GGGGC TAT T CACT TT TAT AT AT T TATAT AAAATT AG 

J* TAGTGAGATGTAAAAAAAAAAAAAAAAAA 

m 

* 

If! 
■»? - 

o 

ill 
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FIGURE 68 



MANPGLGLLLALGLPFLIARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLIAVGLALLTOKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 8 6-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACT AGAGAG GAACA ATG GG GTT ATT CAGAG GTT TT GT TT TC CT CTTAGTT CTGTGCCTGCT GCACCAG 
T CAAAT ACT TC CTT CATT AAGCT GAAT AATAAT GGCT TT GAAGATAT TG TCATT GTTATAGATCCTAGT GT G CCAGAA 
GAT GAAAAAATAAT TGAACAAATAGAGGATAT GGT GACTACAGCTT CT AC GT AC CT GTT TGAAGC CACAGAAAAAAGA 
TTTT TTT T CAAAAAT GTAT CT ATATTAATTC CTGAGAATT GGAAGGAAAAT CCT CAGTACAAAAGGC CAAAACATGAA 
AAC CATAAACAT GCTGAT GTTATAGTT GCACCAC CTACACT C CCAGGT AGAGAT GAAC CATACACCAAGCAGTT CACA 
GAAT GT G GAGAGAAAGGCGAAT ACATT CACTT CACC CCT GACCTTCT AC TT GGAAAAAAACAAAAT GAATAT GGACCA 
C CAGGCAAACT GTT TGT CCAT GAGT G GGCTCAC CT C CGGTGGGGAGT GTTT GAT GAGTACAATGAAGAT CAGC CTTT C 
T AC CGT GCTAAGT CAAAAAAAAT CGAAGCAACAAGGT GTT CC GCAGGT ATCT CT GGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GTCTTAGTAGAGCAT GCAGAATT GATT CTACAACAAAACTGTAT GGAAAAGAT T GT CAATT CTTT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATGCAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAA 
AAAACC CATAATCAAGAAGCT C CAAG C CTACAAAACATAAAGT G CAATTTTAGAAGTACAT GGGAGGTGATTAGCAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
AGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCT GGGTGGGGAT GGTT CACTTTGATAGTACTGCCACTATTGTA 
AATAAGCTAAT CCAAATAAAAAGCAGT GAT GAAAGAAACACACT CAT GGCAGGATT ACCTACATAT C CT CT GGGAGGA 
ACTTCCATCTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGATCCGAAGTA 
CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
TT TATT GCTTT GGGAAGAGCT GCT GAT GAAG CAGT AATAGAGAT GAGCAAGATAACAGGAGGAAGT CAT TTTTATGTT 
TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
T CC CTT CAGCT C GAAAGTAAGGGATTAACACT GAATAGTAAT GC CT GGATGAACGACACT GT CATAATT GATAGTACA 
GTGGGAAAGGACAC GT T CT TT CT CAT CACAT GGAACAGT CT GC C T C CCAGTATTTCT CT CT GGGATC CCAGT GGAACA 
ATAAT GGAAAATTT CACAGTGGAT GCAACTT C CAAAAT G GC CTATCT CAGT ATT CCAGGAACT GCAAAGGT GG GCACT 
T G GGCATACAATCTTCAAGCCAAAGCGAACCCAGAAACAT TAACT ATTACAGT AACT T CT CGAGCAGCAAATTCTTCT 
GT GCCT CCAAT CACAGT GAAT GCTAAAAT GAATAAGGAC GTAAACAGTT TC C CCAG CCCAAT GATTGT T TACGCAGAA 
ATT CTACAAGGATATGTACCT GTT CT TGGAG C CAAT GTGACT GC TTT CATT GAATCACAGAATGGACATACAGAAGTT 
TT GGAACT TTT GGATAATGGT GCAGGCGCTGATT CT TT CAAGAATGATGGAGT CTACT CCAGGTATTTTACAGCATAT 
ACAGAAAAT GGCAGATATAGCTTAAAAGT TC GGGCT CAT GGAGGAGCAAACAC T GC CAGGCT AAAATTACGG C CT CCA 
CT GAATAGAGCCGC GTACATAC CAGGCTGGGTAGT GAAC GGGGAAATTGAAGCAAACC CGCCAAGACCT GAAAT TGAT 
GAGGATACT CAGAC CACCT TGGAGGATTT CAGCCGAACAGC ATC CGGAGGT GCATTT GT GGTAT CACAAGT C C CAAG C 
CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCA^ 

ACATGGACAGCACCAGGAGATAATTT T GAT GTT GGAAAAGTT CAAC GTTATATCATAAGAATAAGT GCAAGTATTCTT 
GAT CTAAGAGACAGTTTTGAT GAT GCTCTTCAAGTAAATACT AC TGATC T GT CAC CAAAGGAGGCCAACT CCAAGGAA 
AGCTTT GCATT TAAACCAGAAAATAT CTCAGAAGAAAAT GCAACCC ACATATTTAT T GC CATT AAAAGTAT AGATAAA 
AGCAAT TT GACAT CAAAAGTAT CCAACAT T G CACAAGT AACT TT GTT TATC CCT CAAGCAAAT C CT GATGACAT TGAT 
CCTACACCTACT CCTACT CCTACT C CTAC T C CTGATAAAAGTCATAATT CT GGAGT TAATATTT CTACGCT G GTATT G 
T CTGTGATT GGGT CTGTT GTAATT GT TAACTTTATTTT AAGT AC CAC CATTTGAACCT TAACGAAGAAAAAAAT CTT C 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATT T CT GAAT CT TAAAATT CAT CCCAT 
GT GT GAT CATAAACTCATAAAAATAATT TTAAGAT GT C G GAAAAGGAT ACTT TGATTAAATAAAAACACT CATGGATA 
TGTAAAAACTGTCAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAACAAAG 
AT C CTTT TTCATACTGATACCT GGTT GTATATTATT T GAT GCAACAGTTTT CTGAAAT GATATTT CAAATT G CAT C AA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAAGGAGA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 



iU;. 

If! 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQS IDSWEFCNEKTHNQEAP 
S LQNI KCNFRS T WE VI SNS E D FKNT I PMVT P P P P P VFS LLK I S QR I VC L VLDKS G SMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
S HFYVSDE AQNNGL I DAFGAL TS GNTDLS QKS LQLE SKGLTLNSNAWMNDTVI IDS TVGKDT FFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANS S VPPI T VNAKMNKDVNS FPSPMIVYAE I LQGYVPVLGANVTAFI E SQNGHTEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYS LKVRAHGGANTARLKLRPPLNRAAY I PGWWNGE IEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SAS ILDLRDSFDDALQVNTTDLSPKEANSKES FAFKPENI SEENATHI FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains : 

amino acids 284-300, 617-633 



Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



APP ID=10063685 



Page 224 of 322 



FIGURE 71 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAG CAATGAT GTAGCCACCT C CT AAC CTT CC CTT CTTGAAC CC C CAGTT AT GCCAG GATT T ACTAGAGAGT GT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACTT GCCT GAGGGT GGAC CAGAAGAAAGGAAAGGT C CC CT CTT GCT GT T GGCT GCACAT CAGGAA 
GG CT GT GAT GGGAATGAAGGT GAAAACT T GGAGATTT CACT T CAGT CAT TG C TT CT GC CT GCAAGAT CAT CCT T TAAA 
AGT AGAGAAGCT G CTCT GTGT GGT GGTTAACT C CAAGAGGCAGAACT C GTT C TAGAAGGAAAT GGAT GCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTAC GT GAGCAGCCTGAAGCGGCAGAT CG CACAGCTCAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
H AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAA^CCCAGGCCGACCTC 
f*\ CTGGCCTT C CT GCACT CGCAGGT GGACAAGGCAGAGGT GAAT GCT GGC GTC AAGCT G GCCACAGAGTAT GCAGCAGT G 

CC T T T CGATAGCTT TACT CTACAGAAGGT GTACCAGCT G GAGACT GGC CTTACCCGC C AC CC CGAGGAGAAGCCTGT G 
CI AGGAAGGACAAG C GGGAT GAGTT GGT GGAAGC CATTGAATCAG CCTT GGAGACC CT GAACAAT CCT GCAGAGAACAGC 

iff| CC CAAT CAC CGTC CTTACACGGC CTCT GATT T CATAGAAGGGAT CTAC CGAACAGAAAGG GACAAAGGGACATT GTAT 

■f I GAGCTCACCTTC^U^GGGGACCAC^AACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 
W GTGAAAAAT GAAAAGCTCAACAT GGC CAACAC GCTTAT CAAT GT TAT C GTGCCT CT AGCAAAAAGGGT GGACAAGTT C 

ff% CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
^ AAAGAAGAAATAAATGAAGT CAAAGGAATACTTGAAAACACT T CCAAAGCT GCCAACTT CAGGAACTTTACCT T CAT C 

U| CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGT GAT GT GGACAT CTACTT CACAT CT GAAT T C CT CAATACGT GTAGGCT GAATACACAGC CAGGGAAGAAG 
GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
S CAGCAGCT GGT CATAAAGAAGGAAACT GGATTTT GGAGAGACTT T G GAT T T GGGAT GACGT GTCAGTAT CGGT CAGAC 

P| TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
*Z CACAGCAACCTCATAGT GGTAC GGAC GCCTGT GC GAGGACT CTT CCACCTCT GGCAT GAGAAGCGCT GCAT GGACGAG 

II S CTGACC CCCGAGCAGTACAAGAT GTGCAT GCAGT CCAAGGCCAT GAACGAGGCATCC CAC GGCCAGCTG GGCAT GCT G 

Q GTGTTCAGGCAC GAGATAGAGGCT CAC CTTC GCAAACAGAAACAGAAGACAAGT AGCAAAAAAACATGAAC TCC CAGA 

GAAG GATT GTGGGAGACACTT TTT CT TTC CTTTT GCAATTACT GAAAGT GGCTGCAACAGAGAAAAGACTT CCATAAA 
SCl GGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 
f''i AAT CAAAAT CTC C GCT TT GCCT GCAAAAGTAACC CAGT T GCAC CCT GT GAAGT GT CT GACAAAG GCAGAAT GCTTGT G 

AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
1 II AATATT CAT GATT TAAGAGCAGT TTT GTAAAAAATTCATTAGCATGAAAGGCAAGCATATT T CT CCT CATAT GAAT GA 

GCCT AT CAGCAGGGCT CTAGTTTCTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAG GAGATAGGCTTATTAT GATACT 
AGT GAGTACATTAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAAT AT C GT GT CATATTTT CCC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTT TT CCCTT GT GAGTTATAGTCTGCTTATTT AATTAC CACTTT G CAAGC CTTACAAGAGAGCACAAGTT GGCCTAC 
ATTT TT ATATTTT TTAAGAAGATACT TTGAGATGCATT ATGAGAACT TTCAGTT CAAAGCAT CAAATT GAT GC CATAT 
CCAAGGACATGCCAAATGCTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACT 
AAT ACAGACGTACAGATAC TTT CT CT GAAGAGTATTTT C GAAGAGGAGCAACT GAACACTGGAGGAAAAGAAAAT GAC 
ACTTTCTGCTTTAC^GAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAGAAACCACATTTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGG GTGCT CCT CCT GGCT T CT GGCTT CCATAAGAAGAAATG GAGAAAAATATAT ATATATATATATATATTGT 
GAAAGATCAAT C CAT CT GC CAGAAT CTAGTGGGATGGAAGTTTT T G C T ACAT GT TAT CCACC CCAGGCCAGGT GGAAG 
TAACTGAATTATTTTTTAAATTAAGCAGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTT 
CAAACTATT TTTTAAAAATAAATACAGT TAACATAGAGT GGT TT CTT CATT CAT GT GAAAAT TATTAGC CAGCACCAG 
AT GCAT GAGCT AAT TAT CTCT TT GAGT CCTT GCTTCTGTTT GCT CACAGTAAACT CATT GT T T AAAAGCTT CAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
C CAT GAAT GGAAGGTGGTATT GCACAGCTAAT AAAAT AT GAT TT GT GGATAT GAA 
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FIGURE 72 



2^ 



MMMVRRGLLAWISRWVLLVLLC 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRP Y T AS D F I E G I YRT ERDKGT L YE LT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAW 
TLINVIVPLAKRVDKFRQFHQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGE FSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGG FDLDIKGWGGEDVHLYR 
KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACT GCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATT TGTCC T GGGGAT C CA 
GAAACCCAT GATACCC TACT GAACACC GAAT CCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
T GAAGCCACAT T TGCAGAGC TCCACAT TGT ACAT TAT GACT CTGAT TCC TATGACAGCT TGAGT G 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TATGAACACAT TCTGAGTCACT TGCATGAAGT CAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GC T CGCT CACAAC TCCC CCT TGCTACCAGAGTGTGC T C T GGACAGT T T T TTATAGAAGGT CCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGC ATAAA TTCCTTCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 



FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFAS FI QAGS S YT T GEMLS LGVG I LVGCLCLLLAVYFI ARK IRKKRLENRKS 
WFT S AQAT TEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGG'GACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCASACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTAT'TTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AG AG AAG T G G AAG AG AAAT CC AG AAG AC CTTCCTGTTTC CAT G C AAC AAAT AT AC T C C AAT C T GA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
til AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
hi CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
ff] T CT GT GATGGGC T AT TCCAT CTACCGATATATCCACGT TGGCAAAGAGAAACACCCAGCAAAT T T 

W GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
'f2 ACTTTATCACCC TCAATATCTCGGATGATT CTAAAATT TCTCATCAGGATATGAGTT TAC TGGGA 

ill AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
;s GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
p AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
IJ1 ACAGT CAT T GAATATGAATATGAT GTCAGAACCAC TGAC AT TT GT GCGGGGCC TGAAGAGCAGGA 

5f GCT CAGT TT GCAGGAGGAGGT GT CCACACAAGGAACAT TAT TGGAGT CGCAGGCAGC GTTGGC AG 

5 TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
f|| CAGGAGCACACAGAC T C GGAGGAGGGGCCGGAGGAAGAGCCAT CGACGAC CCTGGTCGAC TGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAAT GAAACCTATC TCAT GCAAT T CATGGAGGAATGGGGGT TAT A 
TGTGCAGATGGAAAAC TGA TGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAA?VCAAGTGAG 
TCACCC C T T T GAT CCCAGCCATAAAGTACC TGGGAT GAAAGAAGT T TT T T CCAG T T T GTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYWSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKI I FWYVLP IS ITVFLFSVMGYS I YRYIHVGKEKHPANLILI YGNEFDKRFFVPAEK 
I VINFI TLNI SDDSKI SHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 

5 • 

T C G T G GAGT T CCACAT GAC GAC T GAG G C C C AAGC CAC CAT C C G CAT GGAC AC C AG T GC AAG T GG C 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
m TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
|y CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
ill CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
W TCCTGC CAT CAAGGGT GACACCAT T CAGCT C TACC TGGGGGCCAAGT T GT TGGAC TCACAGGGAA 

If! 

AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
fl TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTSCTCTCTCCAGAAGA 
Ifl ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
0 TGATCAATGAAAAGGC TGCAGATAAGCT GGGATC TACCCAGAT CGT GAAGAT CC TAACTCAGGAC 

JJf ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
~H TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACACCAAAGGT GACCAAC T TAT ACT CAAC T TGAATAACAT CAGC TC TGAT CGGATCCAGC T G 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSi^REK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMT TE AQAT I RMDT S AS G P TRLVLS DCAT SHGS LRI QLL YKLS FLVNALAKQVMNLL VP S L PNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLI LNLNN I S SDRI QLMNSG I GW FQPDVLKN 1 1 TE 1 1 HS I LLPNQNGKLRSGVPVSLVKALG 
FEAAE S S LTKDALVLT PAS LWKPS S PVS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCCTGGCAGAGAGACT CT GAAA.TGAGGGATTAGAGGTGTT CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
5"j ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
ff| CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
UJ CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
tf 5 CAAGCCCAAG C TC T TGCCACAAGGAGC T C T C CAAGGCC T GGT CAAC CT C CC AAAGTC AAGAGTGA 

GTTGAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
~ " GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
H GT CAGAAGGT GC TGC T GAGGAT AGACT GACT T TGGCCAT TGGATT GAGCAAAGGCAGAAAT GGGG 

III GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
W TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

Hi 

GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
ifs TCCCACTGACTGACCCTCTGTGATCA?iAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACT AAT T TG TGCATGAAC T GAAAT AAAACCATCC T AC GG TAT CCAGG GAACAGAAAGC AG GAT G 
CAGGAT GGGAGGACAGGAAGGCAGCCTGGGACATT TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I Y S T LL GL PAD I QAAQAMMVTS SAI S S LAC 1 1 S WGMRC TVFCQE SRAKDRVAVAGGVF FI LGGL 
LG FI PVAWNLHG I LRDFYS PLVPDSMKFE I GEAL YLG 1 1 S SL FS L I AG 1 1 LC FS C S SQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 4 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 8l 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGGTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATAC TCC C CCGGCGAGAGC TGGCACCCCTACT T GGAGCCACAAGGCCT GATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCGACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACT GCCAGACTCC TGCT GCCAAGCCTGCAAAGAT GAGGCAAGTGAGCAAT CGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGAC T TGGTGGAGAT CTACCT CT GGAAGC TGGT AAAAGATGAGGAAACT GAGGC TCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCGCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACCTAACAGTT GCAGATATGAGCTGTATAATTGT TGTTATTATATATTAATAAA 
TAAGAAGTTGCATTACCCTCAA?W^AAAAAAAAAAAAAAAA 
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FIGURE 82 



l^PEVRVLSSLLGI^LWFPLDSHAIU^PDMFCLFHGKRYSPGESWHPYLEPQGmYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTT CPE PGCPAPLPLPDSCCQACKDEASEQS DEEDS VQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8s 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGT TT CC ATGCCGT GAGGTCCAT T CACAGAAC AC ATCC ATGG CT CT C ATGCT CAGT TT GGT T CT GAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGT T CT CT AGC G TGGTCC ACCT CT ACAGGGAC GGGAAGGACC AGCC AT TT AT GC AGAT GC CACAGT AT C 
AAGGCAGG AC AAAAC T GGT G AAGGAT TC T ATT GCGGAGGG GCGC AT CTCT CT GAGGCT GG AAAACAT T AC T 
3 . GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
f*\ ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
f s ! TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
if! ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
|tj CGCCGGGAGC AT AT CC TGTT CCATGC GGC ATGCT CATCTGAGCC GAGAGGTGG AATCC AGGGT ACAG ATAG 

CP GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
**' GAGAAGAAAGCACGGACAGGC AGAAT TGAGAG ACGCCC GG AAAC ACGC AGTGGAGGT G ACTCT GGAT C C AG 

IL AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 

3' "S 

g 5 | CC TC AC TCT G AGAAG AGATT T AC AAGGAAG AGTGT GGT GGCTTCTC AG AGTT TCCAAGCAGGGAAACAT T A 

*** CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
ffi! AGGAGT AC GT GACT TT GT CT CC CG AT CATGGG TACT GGGT CCT C AGACT G AAT GGAGAACAT TTGT AT TT C 

g| ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
fjj TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT T AT TGAGGCCCT AC AT TG AGTATC CGT C CT AT AATG AGC AAAAT GG AACT CCC AT AGT CATC TGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TG AGT CCT CCTC AC AGGC AACC ACGCCCTT CCTCCCCAGGGGTG AAAT G TAGG ATG AAT CAC AT CCCAC AT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GACATT AC AT TT AGTT TGCT CT CACTCC AT CT GGCT AAGT GATCT TGAAAT ACC ACCT CT CAGGT GAAG 
AACCGT CAGGAATT CCCATCTC ACAGGCT GTGGT GT AG AT TAAGT AGACAAGGAAT GT GAAT AATGC TT AG 
AT CTT ATTGATGAC AG AGTGT ATCCT AAT G GT TT GT TC AT TATATT AC ACT T T C AGT AAAAAAA 
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FIGURE 84 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLI S I TGYVDRDI QLLCQS SGWFPRPTAKWKGPQGQDLS TDSRTNRDMHGL FDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QS FQAGKHYWE VDGGHNKRWRVGVCRDDVDRRKE YVTLS PDHGYWVLRLNGEHL YFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
•PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain : 

amino acids 239-255 
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FIGURE 8* 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGAC ATG CTGCTGCTGCTGCTGCCCCT 
GCT CT GGGGGAGGGAGAGGGC GGAAGGAC AGACAAGTAAAC TGC T GAC GATG CAGAGTTCCGT GA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCACCC T GAGCATCAGAGATGCCAGAAGAAGTGAT GCGGGGAGA 
T AC TTCTTTCGTAT GGAGAAAGGAAG T AT AAAAT GGAAT T AT AAACAT C AC C GGC T C T C T GT GAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
C T GAGGGAT GCAGC T GAAT T CACC T GC AGAGC T CAGAAC CCT CT CGGCT CT CAGC AGGT C T AC CT 
G AAC GTCTCCCTG C AG AGCAAAG C C AC AT C AGGAG T GAC T CAG GGGGTGGTC GG GG G AGC T GG AG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCT TGGGACTCGCGGGGACAGGAGGCCACTGAC AC CGAGTAC T CGGAGAT CAAGATCCACAG 
ATGAGAAAC T GC AGAGACT CACCC TGAT T GAGGGAT CACAGCCCCT CC AGGCAAGGGAGAAGT CA 
GAGGCT GAT T C TT GTAGAAT TAACAGCC CTCAACG T GATGAGCTATGAT AACAC TAT GAAT TAT G 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
T C C C T T T TAT T T T T T T AAC T AAAAG AC AGAC AAAT T C C T A 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQS SVTVQEGLCVHVPCS FS YPSHGW I YPGPWHGYWFREG 
ANTDQDAPVATNNPAIUVVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 

YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CC AAGGAAAG TGCAGC TGAGAC T CAGACAAGAT T AC AATGAACCAAC T C AGCT T CCTGCTGTTTC 
T CAT AGCGACCACCAGAGGAT GGAGTACAGAT GAGGC TAATAC T T AC T T CAAGGAATGGACC T G T 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGT^ATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GT TT GGCAT CTACCAGAAATAT CCAGT GAAATATGGAGAAGGAAAGT G T TGGACT GACAACGGC C 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGC GGGAAT TCAC TGCGGGAT T T GTT CAGT T CAGGGTAT T TAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACT CAT GTT GGTTACAGCAGCAGC CGT GAGATAACT GAGGCAGC T G T GC T TCTAT T C TATCG 
T TGA GAGTTT TGTGGGAGGGAACCCAGACCT CTCC TCCCAACCATGAGAT CC CAAGGATGGAGAA 
CAACTTACCCAGTAGCTAGAATGT TAAT GGCAGAAGAGAAAACAATAAAT CATATTGACTCAAGA 
AAAAAA 



FIGURE 88 

MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSK7VDYPEGDGNWANYNTFGSAEAAT 
S DD YKNPG Y YDI QAKDLG I WHVPNKS PMQHWRNS S LLRYRTDT G FLQTLGHNL FG I YQKYP VKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYS P YGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHC I GGGGYFPEAS PQQCGDFSGFDWSGYGTHVGYS S SRE I TEAAVLL FYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

GlycosaminoglYcan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTC TGA TTACCTTCA. 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCT GGGATTATCCGCATT GTATT TAGTGCTT TGTAATAAAATATGTTT TGTAGTAACA 
T TAAGAC T T ATAT ACAGT T T T AGGGGACAAT T AAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAG 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
H AGT TCCACCAGGAACAGCAGATC T TCAT CGAAGCCAAGGCC T CCAAAAT CT T CAAC T GCCGGAT G 

T58K? 

CP GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
W CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
J CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
s TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
C3 AG G C AG GGAG GG GG GT GGAGACGAG GAGAT GC C AAGT GGGG C C AG G GC C AAG T C T CAAG T GG C AG 

JJ* AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
5f AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
f|| CCTTGGTTCTTGC CAT C C T GAG GAAA.G AT AG CAAC AG G GAG G G G GAGAT T T C A T C AG T G T G G ACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACAC CCAT CC T TAAGC TAAGACAGGACGAT TGTGGTCCTCC CACAC TAAGGCCACAGCCCAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
S VH FQHNATGQGN ISIS LVP PSKAVE FHQEQQI FI EAKAS KI FNCRME WEKVERGRRT S LC THDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGC7\AGAGTCATTATTGACAACA 
AAG AT GG AC C AAC ACAGAAAT AT C T G C T GAT C T T T GG AG CGTTTGTCTCTGTC TAT AT C C AAG AA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCC TGAAAC TC TGCCTGC T CT GCCAAGACAAGAACT T TC T T CT T TACAACCAGCGCT CCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
T T T T C T G AAAAT CCCTTTTTCTGGTG G AAT T GAGAAAG AAAT AAAAC TAT G C AGAT A 
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FIGURE Q4 

M TAAV F FG CAF I AFG P ALAL Y VFT I AI EPLRIIFL I AGAF FWL VS LL I S S LVW FMARV 1 1 DN KD G 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
T AATCCAGATGTCCAGGAT GGAAGCC T TCCAGCAGGAGGAGCAGG T GT AAAT CCT GCCAC CCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCAT C CAAAGGAGCACACAT GCCATCGAGGAAGCCACCACAGAAT CAGCAAATGGAAT T CA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TGAT TATATTAT GGAATAGAT T GAGACACAT TGGATAGTCT TAGAAGAAAT TAAT TC T TAAT T 
TAC CTGAAAATAT T C T TGAAATT TCAGAAAATAT GTT CTATGTAGAGAAT CCCAAC T T T TAAAAA 
CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
T AAAACAT AT T TGGAAAAC TGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQI FTSLI IHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DDDFAVT T PAGI QRS THAI EEAT TS SANG I Q 



Signal peptide: 

amino acids 1-16 

H 

a 
o 
in , 

m 
m 
m 



a 
III 
a 

CO 

C3 

rii 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 

CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 

GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 

AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 

CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 

GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 

GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 

AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 

AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 

GGAT GAGCCAGGCACAGCCAAC TC GGAT CT T CGAT TCCACAT CC T GAGCCAGGCT CCAGCCCAGC 

CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 

ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 

CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 

CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 

CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 

T GCAGAGGGAAACC TC TACG T GACCAGAGAGCTGGACAGAGAAGCC CAGGCT GAGTACC T GCTCC 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 

AT GGAT GAGAATGACAACGTGCCTAT CT GCCCTCCCCGTGACCCCACAG T CAGCAT C CC T GAGC T 

CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 

CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 

T CGAAGT CGCAGTCACAGATAT CAAT GAT CACGCCCCT GAGT TCATCACT TCCCAGATTGGGCC T 

ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 

CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 

GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 

GCAGCTCCAAG.TCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 

AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 

CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 

ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 

TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 

GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 

GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 

GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 

CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 

CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 

ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 

GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 

TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 

TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 

TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 

CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T S LDHALERT YQLLVQVKDMGDQASGHQATATVEVS 1 1 E S TWVS LEP I HLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHIIPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT C CAAGATTCT TC CCAGGAACACAAACGT AGGAGAC CCACGC T CCT GGAAGCACCAGCCT T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTTGCTTTCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
T T CT CCT TATGT T TGGTC TACTAT T GCAT T T AGAAGCTGCAACAAATT CCAATGAGAC TAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCAC CAAC TC TGAG TTCCATACAACC TC CAGT GGGATCAGCACAGCCACCAAC T CT GAG 
T T C AG C AC AG C G T C C AG T GG GAT C AG CAT AG C C AC C AAC T C T GAG T C C AG C AC AACC T C C AG T G G 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
jU CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
7£ CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
CH GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
|l| AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
ff| GCACAGCCACCAAC TC T GAC TCCAGCACAGT GTCCAGTGGGGCCAGCACT GCCACCAAC TCTGAG 

:^ TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
W GGC TAGCACAGCCACCAACTCTGAC TCCAGCACAACC T CCAGTGGGGCC GGCACAGCCAC CAAC T 

If! CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
g AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
f«J CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
?S CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAAC T C T GAC TCCAGCACAAC C T CCAGTGAGGCCAGCACAGCCACCAAC T C TGAGT C TAG 
O CACAG T GTC CAGT GGGATCAGCACAGTCACCAAT TCTGAGT CCAGCACAACCT CCAGTGGGGCCA 

0| ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
p| ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
J?]- - GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
5 M TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCC TGA GCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT CTTGAAGAAGGTATTCC TCACCTT TCTT GCCTT TACCAGACACTGGAAAGAGAATACTATAT 
T GC T CAT T T AGC T AAGAAAT AAAT AC AT C T CAT C T AAC AC AC AC G AC AAAG AG AAG CTGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CTGGCAT TCAAAAT C TCCACAGTAAAAT CCAAAGAC CTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNS'EFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGAJSfTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 

CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 

GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 

GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 

AAC GAG GCAGT T T C T AGC T AC T G C AC G TACT T CAT AAAGCAG G AC T C T AAAAG C T T T GGAAT CAT 

GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 

TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 

CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

GAT TAT AACT GGGGAT GCAT T TG T T CC T GGAGAAAGAAGTGT CAT T ATCATGAACCATCGGACAA 

G AAT GGAC T G GATGT T C CTGT GGAAT T GCCT GAT GC GAT AT AGCT AC C T CAGAT TGGAGAAAAT T 

TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 

C T T CAT T CATAGGAAAT GGAAGGATGACAAGAGCCAT T TCGAAGAC AT GAT T GAT TACT T T T GT G 

ATATTCACGAACCACTTCAACTCCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 

T C T CGAAGTAATGCAT TTGC TGAAAAAAATGGAC T TCAGAAATAT GAAT ATG T T TTACAT CCAAG 

AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 

ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 

CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 

CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 

GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 

CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGGAATGTGCCTACTCAT 

ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 

GAAT AT TTGGTGGACT GGAGAT CAT AGAACT TGCATGT TACCGAC T T TT ACACAAACAGC CACAT 

T T AAAT T CAAAGAAAAAT GAG TAA GAT TAT AAGGT T TGCCATGTGAAAAC CTAGAGCATAT T T TG 

GAAATGTTCT AAACC TT TCTAAGCT CAGATGCAT TT TTGCATGACTATGT CGAATAT TTCT TACT 

GCCAT CATT AT TT GT TAAAGATAT T TTGCAC T TAAT T T T GT GGGAAAAAT AT TGC TACAAT TT T T 

T T T AATC TC T GAATGTAAT T T CGATAC T GT G TAC AT AGCAGGGAGT GATC GGGGTGAAATAACT T 

GGGCCAGAATATTAT TAAACAAT CATCAGGCT T T TAAA 



FIGURE 102 



MHSRGREI WLLNPWS INEAVSSYCTYFIKQDSKS FGIMVSWKGI YFILTLFWGS FFGS I FMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVCT 

FLWNCLMR YS YLRLEKI CLKAS LKGVPGFGWAMQAAAY I F I HRKWKDDKS H FEDM I DYFCDI HE P 
LQLLI FPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDI TVA 
YPHNI PQSEKHLLQGDFPRE IHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLVVKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IE LAC YRLLHKQPHLNS KKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATGCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCC ATG GCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT CT ACAGAGAT GGGGAAGAC TGGGAATC TAAGCAGATGCCACAGT AT CGAGGGAGAAC TG A 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
U AGGAT T TGTC TT CAGAC TCCAGAGCAAAT GCAGATGGGT ACAGCCT GT AT GAT GT GGAGATC T CC 

n AT TATAGT C CAGGAAAATGC T GGGAGCATAT TGTGTT CCATCCACC TT GCT GAGCAGAGT CAT GA 

ill GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
y TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
IP AAATCCAAAGGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAG 
hi AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
m TTT CT GATCTGAAAACTGTAACCCATAGAAAAGC TCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 

%i TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
IF1 ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
s ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
r'| GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
TGACATGT CAGT T T GAAGGC TT GT T GAGACCC TAT AT CCAG CAT GC GAT GTAT GACGAGGAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
O CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
g] CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
f§ TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
«K TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT TCCTGCCTCACAGGT GAAGAT TAAAGAGACAACGAATGTGAATCATGCTTGCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCTTATTCCACATTAATTTACTTTTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 
TGCT T T GT GAGGC T C AAAGAATAAAG AG GAGGT AGGAT T TT TCACT GAT T C TATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGC7VAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
CT CAT TAACACAGACACAAAAATT CTAAATAAAAT TT TAACAAATTAAACTAAACAATATAT TTA 
AAGAT GATATATAACTACTCAGTGTGGT TTGTCCCACAAATGCAGAGTT GGTTTAATATTTAAA.T 
ATCAACCAGTG TAAT T CAGCACAT TAATAAAGTAAAAAAGAAAAC CATAAAAAAAAAAAAAAA 
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MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGS LPL I S I VGYVDGGI QLLCLS SGWFPQPTAKWKGPQGQDLS S DSRANADGYSLYDVE I S I 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGWMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQG FQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVT LS PNNGYWVLRLTTEHL YFT 
FNPHFI SLPPS TPPTRVGVFLDYEGGT I S FFNTNDQS L I YTLLTCQFEGLLRPYI QHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CCTT CACAGG ACTCTT CATTGC TGGT TGGC AATGAT GT AT CGGCC AGAT GTGGT GAGG GCTAGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
T G AC AAAC T AT AT GCT GAGT TT GGCAGAGAGGCT TC T AAC AATT TT AC AG AAAT GAGCC AGAG ACTT G AAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTA^GGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTC AACAGAAGC ATGGAGT GT TGGCT C AT AT GC T GTT GATT T GT AGATTTC AC TCT ACT GAGG AT CCTG A 
AACT GT AG AT AAAAT T GT T C AACT T GTT TT AC AT GAAAAGCT GC AAGATG CT GT AGGACCC CCT AAAGT AG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACCTT CGAAAATGAAACGGGGTCT CCG GAGAAT AAT T GT CCAT GAftAAAT ACAAACACCCAT C AC A 
TG ACT AT GAT ATTT CT C T TG CAGAGC TT TCT AGC CCTGTT CC CT AC ACAAAT GC AGT AC AT AGAGT T TGT C 
TC CCT GAT GC AT CCT AT GAGTT TCAACCAGGT GATGTGAT GT T T GT GACAGGAT TT GGAGC ACT GAAAAAT 
GATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACTTGCAATGAACC 
TCAAGCT T AC AATGACGCC AT AACTCCT AGAATGT T AT GT GCTGGCT CCT TAGAAGGAAAAAC AGAT GCAT 
GC CAGGGT GACTCT GGAGGACC ACTGGT T AGT TC AGAT GCTAGAGAT AT C TGGT ACCT TGCT GG AAT AGT G 
AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
G ATT ACTTC AAAAACT GGT ATC TAA GAGAC AAAAGCCT C ATGGAAC AG AT AAC ATT TT T TTT TGTTTTTTG 
GGT GTGGAGGCC AT TT TT AGAGAT AC AGAATT GGAG AAGACTTGCAAAAC AGCT AGAT T TGACT GAT CT C A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACT CT GT CATC TGTGAGC AAT AGT T GAAACTTT AT GT AC AT AGAGAAAT AGAT AAT AC AAT ATTAC 
AT T ACAGCCT GT AT TC ATTT GT TCTC TAGAAGTT T TGT CAGAAT TT TGACTT GTTG AC AT AAAT TT GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TC AAGGT G CAGAAC AAGG AGTGAAAGAAAAT AT AAGAAGAAAAAAATCCCCT AC AT TTT AT T GGCAC AGAA 
AAGT ATT AGG TGTT TT TCT T AGTGGAAT AT T AGAAATGAT CAT ATT CATT AT GAAAGGT C AAGCAAAG AC A 
GC AGAATAC C AATC AC TTCATC AT TT AGGAAGT ATGGGAACT AAGT TAAG GAAGTC C AGAAAGAAGCC AAG 
AT AT AT CCT T AT TTT CAT T TCC AAAC AACT ACT AT GAT AAAT GT GAAGAAGATT CT GT TT TT TT GT GACCT 
AT AAT AAT T AT ACAAACTTC AT GCAATGT ACT TGT TCT AAGC AAAT T AAAGC AAAT AT T T AT TT AAC ATT G 
T T ACTGAGGAT GTCAAC AT AT AACAATAAAAT AT AAATCACCCA 
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MMYRPDVVRARKRVCWEPWVI GLVI FI SLIVLAVC IGLTVHYVRYNQKKTYNYYS TLSFTTDKLY 
AE FGRE ASNNFTEMS QRLE SMVKNAFYKS PLREE FVKS QVI KFS QQKHGVLAHMLL ICRFHS TE D 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRI IVHEKYKHPSHDYDI SLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVT L I DAT T CNE PQAYMDAI T PRMLCAG S LEGKTDACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACA ATG GCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATT^lTGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 

tggaacggtgaaaagaaatctgttgcttcttgcatggtactttggaagtttgcttgtcattttct 
gtgtagaactggcttgtggcgtttggacatatgaacaggaacttatggttccagtacaatggtca 
gatatggtcactttgaaagccaggatgacaaattatggattacctagatatcggtggcttactca 

TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAZ\ATGTATTCCTT 
T T TGAGAGGAACCAAACAAC TGCAGGT GCT GAGGT T T C TGGGAATCTCCAT T GGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
GTT GAAACCAAGCCTGTCAAGAATCTT T GAACACACATCCATGGCAAACAGCTTTAATACACACT 
T T GAGAT GGAG GAG T T ATAAA AAGAAAT G T C ACAGAAGAAAAC CAC AAAC T T G T T T TAT T GGAC T 
T GT GAAT TT T TGAGTACATAC TAT GTG T T TCAGAAATATGTAGAAAT AAAAAT GTT GCCATAAAA 
TAACACCTAAGCATATACTATTCTATGCTTTAAAATGAGGATGGAAAAGTTTCATGTCATAAGTC 
ACCACC TGGACAATAAT TGATGCC CTT AAAATGCTGAAGACAGAT G TCAT ACCCAC TGTGTAGCC 
TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
TCCATGCAAACGAGTCACATATGG T GGGAC T GGAGCCATAGTAAAGGT T GAT T TAG T TCTACCAA 
C T AGTATATAAAG TAC TAAT TAAAT GCTAACATAGGAAGT TAGAAAATACTAATAAC T T T TAT T A 
CTCAGCGATCTAT TCTTCTGATGCTAAATAAATTATATATCAGAAAACT T TCAATAT TGGTGACT 
ACC T AAATGT GAT T T T T GCTGGT TAC TAAAATAT T CT TACCACT TAAAAGAGCAAGC TAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T TCAGTT CT GATAATGT TAAGAATAACCAT TATGAAAAGGAAAAT TTGT CCTGTATAGCATCAT T 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TATT TAAATACTTAACCACTAATT TTGAAAATTACCAGTGTGATAGATAGGAATCAT TATTC 
AGAAT G T AGT CTGGTC T T TAGGAAGTAT TAATAAGAAAAT T T GCACATAACT TAGT TGAT T CAGA 
AAGGAC T TGT AT GC TGT T T T T CT CC CAAATGAAGAC TC T T T T T GACACTAAACACT T T T TAAAAA 
GC T T AT CT T TGCCTTCTC CAAACAAGAAGCAATAGT CT CCAAGTCAATAT AAAT TC TACAGAAAA 
TAGTGT TCTTTTTCTCCAGAAAAATGC TTGTGAGAATCATTAAAACATGTGACAAT TTAGAGAT T 
CT T TGT T T TAT T TCACTGAT TAATATAC T GT GGCAAAT TACACAGAT TAT TAAATT T TT T TACAA 
GAGT AT AGTATAT T TAT TTGAAAT GGGAAAAGTGCAT T T TACT GTAT T T T GT GTAT T TT GT TT AT 
TTCT CAGAATATGGAAAGAAAATTAAAATGTGTCAATAAATAT TT TCTAGAGAGTAA 
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MAREDSVKCLRCLLYALNLLFWmSISVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPVVHP 
VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYS FLRGTKQLQVLRFLG I S IGVTQI LAMI LT I TLLWALYYDRRE PGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGTCT T T C T C GT GAGAGC CT AGAGGC C T T AAAAAAAAAAGTGCT T GAAAGAGAAG GGGACAAAGGAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAAC ACT GACCACCAGTT GGATGAGTCTCAAGGT CCTCCT CT AT GT GAC AACC ATGT GAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
T CAAT GG GAACT GC T GT CTCT GG AACAC CAC GGT GGAAGT C AAGGCT T GCC C T G GAGGCT ACT AT GT GT AT C GT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCTGCTCAGATACCAGCGAGTGCACATGCGCTCCAGGAACTGTGCTAGGCCCTGACAGGCAGACATGCTTTG 
AT GAAAATGAAT GT GAGCAAAAC AACGGTGGCT GCAGTGAGATCT GT GTGAACCTC AAAAACTCCT ACCGCT GT 
GAGT GT GGGGTT GGCCGT GT GCT AAGAAGTGAT GGCAAGACT T GT GAAGAC GT T GAAG GAT GCC AC AATAAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CT GAGGAT AACCACACTT GCC AAGTCCCT GT GTT GT GCAAATC AAAT GCCATTGAAGT GAACAT CCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCT C T T C TC TCT C AAGAC AT GT GGT AC AGT G GT C GAT GT GGT GAAT GACAAGAT T GT GGCCAGC AACCT CGT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGCCGAAATCAT GGGATCT TCCC ATTC ACT CT GGAGATCTT CAAGGAC AAT GAGTTT GAAGAGCCTT ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TT GGAAAGCT T GGT GGAGAGC T GCT T T GC C ACCCC C ACC T CC AAGAT C GAC G AGGT CCT GAAAT AC TACCTC AT 
CCGGGAT GGCT GT GT TTC AGAT GACTC GGT AAAGC AGTAC AC ATCCCGGGAT C ACCT AGC AAAGCACT TCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
T T GGACGAGC GT T CC C GCT GT GC CCAGGGTT GCCACC GG C GAAT GC GT C GT GGGGC AGGAGGAGAGGACT CAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
C AGGT CAC AGC ACT GCT GAAC AAT GTGGCCTGGGT GGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT C AG AAGC T GGGT ATAAT AT T T C AAGT T AC AAACCC T AGAAAAAT TAAAC AGTT ACT GAAAT TAT GA 
CTT AAATACCC AATGACTCCTTAAAT AT GTAAATT ATAGTT AT ACCT T GAAATTTC AATTCAAAT GCAGACT AA 
TT AT AGGGAAT TT GGAAGT GT AT C AAT AAAAC AGT ATAT AATT TT 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPREL VGG 
LEL FLTNT SCRGVSNGTHVNIL FSLKTCGT WDVVNDKI VASNLVTGLPKQT PGSSGD FI I RT SKLLI P VT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

lal 

fs\ Important features of the protein: 

f|| Signal peptide: 

|y amino acids 1-16 

m 

rxr. 

W N-glycosylation sites. 

*M amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 



N-myristoylation sites. 

ff J amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxy lati on site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTG CTGCCT T GGG GTGACAAT CTC AGCT CC AGGCT AC AGGGAGAC CGGG AGGATCAC AGAGCC AGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACT GG TTCT CT GCCT GTTTCGAC AAC T TC AC AGAAGGT CT C GCTGAGAC AGC CTGT AGGCAGAT G G 
GC TACAGC AGAGCT GT GG AG AT TGGCCC AG AC C AGG AT CT GG AT GT TGTT GAAATC AC AGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
T C AGCATCCAGT AC GACAAAC AGC AC GT CT GTGG AGGG AG CATCCT GGACCCCC ACTGGGTCCT CACGGC A 
^ GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 

If! 

CT TC CC ATCCCTGGCT GT GGCC AAGATC AT CATC ATTG AAT T CAACCCCATGTACCCC AAAG AC AAT GAC A 

25 

TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGC TC ACTC C AGC CACCCC AC T CTGGAT C AT TGGAT G GGGCT T T ACG AAGCAGAATGGAGGGAA 
GATGTCT G AC AT ACTGCTGC AG GCGT CAGTCCAGGT CATTGACAGCAC AC GGT GCAAT GC AG ACGATGCGT 
ACCAGGGGGAAGT C ACCGAG AAGAT GAT GTGT GC AGGCAT CCCGGAAGGGGGT GT GGACACCT GCCAGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CT GCGGGGGC CCGAGC ACCC CAGGAGT AT AC ACC AAGGTCTC AGCC TATC TC AACT GG AT CT AC AATGT C T 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATT GCT AAG CCAAGA&GGAAC T TTCCC AC ACTACT G AAT GG AAGCAGGCTGT C TT GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GC CT ACTAGAGC AAGAAACCAGTTGT AAT AT AAAAT GC AC TGCC CT ACTG TT GGT ATGACTACCGT T ACCT 
AC TGT T GT CATTGT T ATT ACAGCT AT GGCC ACT AT TAT T AAAGAGCT GT GTAAC ATCT CT GGCAAAAAAAA 
AAAA 



re". 1 : 



m 
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MLQDPDSDQPLNS LDVKPLRKPRI PME T FRKVG I P 1 1 1 ALL S LAS 1 1 1 WVL I KVILDKYY FLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAE TACRQMGYS RAVE I GPDQDLDWE I TENSQELRMRNS SGPCLS GS LVS LHCLACGKS L 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGAT TGA CT^AGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 

III AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 

W 

HI GAAGCAGT GGGT GAGACATCACGC TGCCGGC CCAT C TAACC T T TT CAT GTCCT GCACAT CAC CT G 

hi ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
lV- CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
r! CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
in CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
1*1 TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 

in 

J]- CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
STs GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAAT GATC T CCAAGGGCCC T TATACCC CAGGAGAC T T TGAT T TGAATT TGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAGCTACT CGGGAGGC TGAGACAGGAGAAT T AC TTGAACCT GGGAGGT GAAGGAGGCT GAGACA 
GGAGAAT CAC T TCAGCCTGAGCAACACAGCGAGACTCTG TCTCAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE ha 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



u 
o 
o 
m 

w 
m 

m 
m 

CI 
in 
C 
CO 
Q 

r 
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FIGURE lis 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAAT CCT CC AGAGAAT TGT GAAGACT GT CACAT T C TAAATGCAGAAGC T T T TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T C T AC AG C AAT GG AGAGAAGAAG AAG AT T T AC AT GGAAAT T GAT C C T G T G AC C AG AAC T G AAAT A 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T G AAC C AG AAG AG GAAAT AG AT GAG AAT GAAGAAAT T AC C AC AAC T T T C T T T GAAC AG T C AG T G 

3—, I 

f% ATTTGGGTCCCAG(^GAAAAGCCTATTGAAAACCGAGATTTTCTT7^AAAATTCCAAAATTCTGGA 
m GATT TGTGAT AACGT GACCATGTAT T GGAT CAATCCCAC TCTAAT AT CAGT T T CTGAGT TACAAG 

lU AC T T TGAGGAGGAGGGAGAAGAT CT T CACTT TCC T GCCAACGAAAAAAAAGGGAT T GAACAAAAT 

GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
?!: AGAACT TCCAATAAATGACTATACTGAAAATGGAATAGAATT TGAT CC CAT GCT GGAT GAGAGAG 

m 

GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
fj G G C TAC T AC C CAT AT C CAT AC T G C T AC CAAG GAGGAC GAGT C AT CTGTCGTGT C ATCAT GCC T T G 

111 TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
— TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 



GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSEPEEE I DENEE I T TT FFEQS VI WVPAEKP IENRDFLKNSKI LE I CDNVTMYW INPTLI S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 



Important features of the protein: 
H Signal peptide: 

l!J amino acids 1-40 

I: Si? 
f?% 

III Transmembrane domain: 

CP amino acids 25-47 (type II) 

in 



•ST.XV 



N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites . 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal. 

amino acids 315-317 



Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 

GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
.„ . TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
f*% GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
f s | TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
ffl GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
||j TGGGTC CAAC AC C AAAAAC AAG AAGATAT ACGAT GGAGGT GGCCGC AC AG AGGACGAGGT AC AATCTT AT C 

f|! CTTCCAAGCACGACTATGTG TAA TGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
W CCCAAAAAAC AAGG AG ATCCC AT CT AGAT TT C TTCTTGCT T TT G ACTC AC AGC T GGAAGT T AGAAAAGCCT 

CGAT T TCATCTT TGGAGAGGCC AAAT GGTCT T AG CCTC AGTCT C TGTCTCTAAATATT C C ACC AT AAAAC A 

1. GCTGAGT T AT TT AT GAAT TAGAGGCT AT AGCT CACATTTT CAAT CC TCT ATT TCTTTT T T T AAAT AT AACT 

jT| 

s^s TT CT ACT C T GAT GAGAGAAT GT GGT T TT AATCTC TCTCTC AC AT TT TG AT GAT T TAGACAGACT CCCC CT C 

if I 

jn?% TT CCTCCT AGT C AAT AAAC C CATTGATGATCT AT TTCCCAGCTT AT C CCC AAGAAAACTT T T GAAAGGAAA 

fn r GAGT AGACCC AAAGAT GT T ATT TTCT GCTGTT TG AATT TT GTCT CCCC AC CCCC AACT T GGC T AGT AAT AA 

p AC ACTTACT G AAGAAGAAGC AAT AAG AG AAAG AT ATTTGT AAT CTCT CCAGCCC AT GATC T CGGTTT TCTT 

flJ ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGG TCAGAAAT T GTCCCTAGAT G AATGAGAAAAT T AT TTT T TT T AAT TT AAGTCC T AAAT ATAGT T AA 
AAT AAATAAT GT T T T AGT AAAATGAT AC ACTATCTCTGTG AAAT AGCCT C ACCCCT AC AT GT GGAT AGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT CAC ACC TGT AAT CCT AGC ACTT TGGGAGGC TGAG GAGG AAGG ATC ACT T GAGCCCAG AAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
TGGT GGC AT ACACCT GTAGT CCC AGC AT T CCG GG AGGCT G AGGT GGGAGGAT C ACT TG AGCCC AGGGAGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AAT AAAAAAT AAAT AATGGAAC AC AGCAAGT C CT AGGAAGT AGGTT AAAACT AATT CT T T AA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLP^LQAVRAmiVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAIAGVSVFANMLVTNFWMS T ANMYT GMGGMVQT VQTR YT FGAAL FVGWVAGGLT L I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



m 

hi 



GGAAAAAC TGTTCTCTTCT GT GGC ACAGAGAAC CC T G CT T C AAAGCAGAAG T AGC AGT T C C GGAG T C C 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAAT GAGAAGGT GAAGGCTCACAT TCT GCT GACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 
TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CT TAAT TACAGGAACT GTGCAT CAGCTAT TTATGATTCTATAAGC TATTTCAGCAGAATGAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCATTAT GTACATAGAT GAGT GTAACAT TTATATCTCACATAGAGACATGCT TAT ATGGT 
*2 TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 

w 

m AT CAT G GAT AG G G T T GAAGAAGGT TAC TAT TAATT G T TT AAAAACAGC T TAGG GAT TAAT GTC C T CCA 

ni T T TAT AAT GAAGAT T AAAAT GAAG GC T T TAAT C AGC AT T G T AAAG GAAAT T GAAT GGCTTTCT GAT AT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGT GTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCT TTGCAT TCAAACTGCTT 
T T C C AGGG C TAT ACT CAGAAGAAAGAT AAAAGT GT GATC T AAGAAAAAGT GAT GGT T TT AG GAAAGT G 
AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATATGTATGGAT 
AT AT T TTAATAAG TAT T TGAGTACAGAC T TT GAGGTT TCAT CAAT AT AAAT AAAAGAGC AGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACATT TATAT TAATAAATTGTACATT TTT CTAAT T 
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FIGURE 120 

MATHALE I AGL FLGGVGMVGTVAVTVMPQWRVSAFI ENNI WFENFWEGLWMNCVRQAN IRMQCK 
IYDSLLALSPDLQAARGIJYICAASVMSFL^^ 

MWLI PVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKS S S Y 
RYS I PSHRT TQKS YHTGKKS PS VYSRS QYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 

* AAGC AG T G T T CAT G GAG T T CAT T G AAT TAT G GC AT AG AT C T T G G G AAAAT T G C G GAG T G T AC AT T 

Si 

\ TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 

f GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 

j ATTGAAGCTATAATTTATTTGGACCAAGGAAGCCCTGAAATGAATTCAACAATTAATATTCATCG 

| CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 

I TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 

* AT TAT T GAAGAACT ACCAAA ATAAA T GCT T TAATT TTCATT TGCTACCT CT T T TT TTAT TATGCC 
, T T GGAATGGT TCAC TTAAATGACAT T T TAAATAAG TT TATGTATACATC TGAATGAAAAGCAAAG 
I CT AAAT ATG T TTAC AGACCAAAGT GT GAT T T CACAC T GT TT TT AAAT C TAGCAT TAT TCAT TT T G 
J CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
1 CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
\ AAAAAAT ATAAAAGC TAC CAAT C T T TGTACAAT T TG TAAATGT TAAGAAT TT TT TT T ATATC T GT 

T AAAT AAAAAT TAT T T C CAACA 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCC7\AAGCAAAAGACCTAAGGACGACC TT TGAACAATACAAAGGATGGGTT TCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCGTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCT TCAAAAACT TAAGTATAAT CAAT T TAAAGGGC T CAACCAGC T CACCT GGC TATAC 
CTTGACCATAACCATATCAGCAATAT T GACGAAAATGCT TTTAAT GGAATACGCAGACTCAAAGA 
GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TC TT TGCT GGCAT GATCAGACTCAAAGAACTTCACCT GGAGCACAATCAAT T T TCCAAGC TCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGT CC TGGAC CT GGAGC TCC T TACAAAGGC T T GAT T T ATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGTC TAAGGGAGAATACAATT AT CT G TGCCAGT CCCAAAGAGCT GCAAGGAGTAAATG TG 
ATCGATGCAGTGAAGAACTACAGCATC TGT GGCAAAAGTACTACAGAGAGGTT TGATCTGGCCAG 
GGCTC T CCCAAAGCCGACGT T TAAGCCCAAGCTCCCCAGGCC GAAGCAT GAGAGCAAACCCCC T T 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GG C AC AG G AAAAAGAAAAG AC AG T C C C T AAAGC AAAT G AC TCC C AG CAC C C AGGAAT T T TAT G T A 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
T AACAAAT CGGGCT CCAGGGAGT GT GAGGTATGAACCAT T GT GATAAAAAGAGCT C T TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CATT TT CCTC TCATACAT AAT CAACCCAT TGAAAT T TAAAT ACCACAATCAAT GTGAAGC T T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 



APP ID=1 0063685 



Page 277 of 322 



FIGURE 124 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLS YNQLHS LGSEQ FRGLRKLLS LHLRSNS LRT I PVR I FQDCRNLELLDLGYNRI RS 
LAROTFAGMIRLKELHLEHNQFSKLNIALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKE LQGVNV I DAVKN YS I CGKS T TERFDLARALPKPT FKPKL PRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKI IAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQE FYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

P! amino acids 420-442 

W N-glycosylation sites. 

m 

IS „ amino acids 126-129, 357-360, 496-499, 504-507 



if'"'" 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCATTTACAGACACGTAGT. 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
H TT AGCCAAT CCAACTGACC TAGTGAAGGTT CAGAT GCAAAT GGAAGGAAA&AGGAAAC TGGAAGG 

jjj* AAAACCAT T GCGAT T TCGTGG TGTACAT CAT GCAT T T GCAAAAAT C T TAGCTGAAGGAGGAATAC 

^1 GAGGGCT TT GGGCAGGC TGGG TACCCAATATACAAAGAGC AGCAC TGGT GAATATGGGAGAT T TA 

|| S AC CAC T TAT GAT AC AG T GAAACAC T AC T T G G TAT T GAAT ACAC C AC T T GAG GACAATAT CAT GAC 

Cp TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
IS T CAAAAG CAGAATAAT GAAT CAAC CAC GAGAT AAAC AAG GAAG G GGAC T T T T G T ATAAAT CAT C G 

m 

* Fi ACT GACT GC T TGATT CAGGC TGT TCAAGGT GAAGGATT CAT GAG T C T ATAT AAAGGCT T T T TACC 

s 

ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
Iff TGAGTGGAGTCAGTCCATTTTAA 

W 

til 
m 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGG IRGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 



APP ID-l 0063685 



Page 280 



FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCC ATGG CCAGGCCCGGCAT6G 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTT CAAT GT GAACGTGC T GGC CCT CAGCATC T GCACACGGGAAGCC TAC CAGTCCAT GA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT T CGCC TTCAAAC TCCACGACAAGGACCC T GAGAAGGCAGC TGCCAC C TAT GAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGAT T GGAGACATCCAGATGAGGCCCACGGAGCAGGT GAC C TAGTGACTGT GGGAGCT CC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGGCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVT GAS GG I GAAVARALVQQGLKWGCART VGN I E E LAAECKS AG YPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTL^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 



Important features of the protein: 
Signal peptide: 

amino acids 1-17 



N-myristoylation sites. 

yy amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 

m 199-205 

U| 

■SJH 

Short-chain alcohol dehyrogenase. 

Q amino acids 30-42, 104-114 

III 

Hi 
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FIGURE 12Q 



AACTTCTAC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAA ATGA GGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAAT T TT TAAC T GATAGT TGT ACAT AT T T GGGGGTACATGT GATATT T GGATACAT GTATACAA 

U 

;„/ T ATAT AAT GATC AAAT CAGG G T AAC T G G GAT AT C CAT CAC AT C AAAC AT T TAT T T T T TAT T C T T T 

IZl T TAGACAGAGTCTCAC TC TGTCACCCAGGC T GGAG TGCAGTGGT GCCAT CT CAGCT TAC TGCAAC 

Iff CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
llj GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

If I 

ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ri ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTGTATCCTTCCC 
Ill AACCTCTGAT CACCTCATTC TACT CTCTACCTCCATGAGATCCACT TT TTTAGCTCCCACATGTG 

AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
pi TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 
Hi GCAAAAA 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGE S PMP FKRVFCQDGNVRS FCVCAVHFS SHQP PVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-7 9 
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FIGURE 131 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACCATGATCAGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT ACCAGAGGAT GC TACAAC TCTCTACC TT CAGAAC AACC AAAT AAAT AATGCT GGGATTCC TTC AGAT 
TT GAAAAACT TGCT GAAAGT AG AAAGAAT AT ACC T ATACC AC AAC AGT T T AG AT GAAT TT CCT ACC AACC T 
CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 
AAAT T CCCTATCTGGAAGAATT ACAT TT AG AT GAC AACTC TGT C TC TGC AGT T AGC AT AG AAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
p=. GC CC AGGACT AT AGAAGAACTACGCT TGGATG AT AATCGCAT ATCC ACT ATT TC AT CACC ATCT CT T C AAG 

f*l GT CT C AC T AGTCT AAAACGCCT GGT T CT AG AT GG AAACCT GTT G AACAAT CATGGT T T AG GTGACAAAGT T 

ff| TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
til TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
fjfl CT T AT CT AAG GC AGCT CT AT CGACTGGAT ATGTC C AAT AAT AACCT AAGT AATT T ACC TC AGGGTAT CTT T 

W GATG ATT T GG AC AAT ATAAC AC AACT GATTCT T CGC AACAATCCCT GGT AT T GC GGGT GC AAGATG AAAT G 

GGTACGT G ACTGGT T ACAATC ACT ACCT GTGAAGGTCAAC GTGCGTGGGCT C AT GT GC CAAGCC CC AGAAA 
AGGTT CGT GG GATGGC TATT AAGG ATCT CAAT GC AGAACT GTTT GATT GT AAGGAC AGTGGGAT TGT AAGC 
AC CAT TCAGATAACC ACTGC AAT ACCCAAC AC AGT GTATC CT GCCC AAGG AC AGTGGC CAGCT C CAGT GAC 
J«I CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
CAAT T AC AAT TACT GT GAAGTCTGTCACCT CT GAT ACCAT TC AT AT CT CTT G GAAACT TG CTCT ACCT AT G 
m ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
f|| AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CC AT GG AAAC C AGC AACCTCT ACCT ATT TG AT GAAACT CC TGT T TGTATT GAGACT GAAACT GC ACCCCT T 
CGAAT GT ACAACCCT ACAAC CACCCT CAAT CG AG AGCAAGAGAAAGAACC TT AC AAAAAC CCCAAT T T ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GC AGAAGCTGGCAC TAAGAAGG AC AACT CT AT CCT GGAAATC AG GGAAAC TT CT TT TC AGATGT TACC AAT 
AAGC AATGAACCCATCTCGAAGGAGGAGTTT GT AAT AC AC AC C ATATT T C CT CC T AAT GG AAT G AAT CTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
T C AC ACTC ATGA T GCT GAAGGACT C AC AGC AG AC TTGTGTTTTGGGTTTT TT AAACCT AAGGGAGGT GATG 
GT 
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FIGURE 132 



MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAG I PSDLKNLLKVERI YLYHNSLDEFPTNLPKYVKELHLQENNI RT I T YDSLSKI PYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDPCVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAI KDLNAE L FDCKDSG I VS T I Q I TT AI PNT VYPAQGQWPAPVTKQPD 
I KNPKLTKDQQT TGS PSRKT I T I TVKS VTS DT I H I SWKLALPMTALRLS WLKLGHS PAFGS I TET 
I VT GERSE YLVTALE PDS PYKVCMVPME TSNL YLFDE TPVC I ETE TAPLRMYNPTT TLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 




Cf! Important features of the protein: 

Wl Signal peptide: 

amino acids 1-28 
: pi Transmembrane domain: 

m amino acids 531-552 

Q N-glycosylation sites. 

ttl amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 

Tyrosine kinase phosphorylation site. 

Ill 

amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 



APP_ID= 



10063685 



Page 286 of 322 



FIGURE 133 



rii 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTC ATG TCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C T CCAC TAC CTCAAGC T GAGTGATCCCAAGTACC TAAGAGAGT TCCAGCTGACCCT CCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACC AC TCCAACAGAGCCAGGAC TATAT CAACCTCT TC TGC GCCAACAT GAT GGAC T TGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
TATCAG CAGCAT TT TTCGAGGAGAGTGAAGAGGC GAGAAAAACAAT T T CCAGAT TC TCGC TC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAA.CCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
T T T TATAT T T T T T TAGTAGAGACAGGGTT TCAT CATGT TGC TCAT GCT GGT C TCGAAC TCCT GAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 124 

MSARGRWEGGGRRACRGSLGIJ^QGAERVTSSEQRPi^SLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
Y I NL FCANMMDLNRRAEAI GYAYPTRD I FMENIMFCGMGGFS D FYKLRWLEAI L S WQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 13 5 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T T ACAGAT TTGATCCCGTT CGAGTGGATATCACT TCGAAAGGAAAAATGAGAGCAAGATATG 
TGAAT TACATCAAAACATCAGAGGT TGTCAGACTGCCC TATCCTCTCCAAATGAAATC'T TCAGGT 
|.« CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
g; TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 

2f C T GAC AT GAGACGGGAAAT GGAGCAG T CAAT GAATATGC TGAAT T CCAACCAT GAGT TGCC TGAT 

If j 

Sji GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 



m 

CO 



|p GAG GT CAAT AT TGATG T CAC T GAATTAAT T ACAGTGTC C TATAGAAAAT GCCAT TAAT AAAT TAT 

IJ ATGAACTACTATACATTATGTATATTAATTAAAACATCTTAATCCAGAAATCAAAAAAAAAAAAR. 
Rl AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVD GE E HVG FLKT DG S FWHD I P S G S YWEWS PAYRFD P VRVD ITS K GKMRAR YVN Y I KT S E 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMMMVLPLLIF^ 
QSKNMLNSNHELPDVSEEHTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38/ 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 127 

GATGGCGCAGCCACAGCTTCTGT.GAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAAC T C GCAC TC AGACT CTGAACT CAGACC TGAAAT C T T C T C T TCAC GGGAGGC T TGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGC T G T GT GAT C GC C ACAAACC T T CAGGAAAT ACGAAAT GGAT T T T CT GAG AT AC GGGGC AG T G 

•PI 

5* TGCAAGCCAAAGATGGAAACAT T GACATCAGAATCT TAAGGAGGAC T GAGT CT T TGCAAGACACA 

Jj? AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
j | AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
p| CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
SI AAGAAAT ACAGCCAGATTCTGAGTCAC T T T GAAAAGCTGGAACCT CAGGCAGCAGT TGTGAAGGC 

TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
JU CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
yi ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
C| CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
Bl TGTAATATC T TTC TGC TATTGGATATAT T TAT TAGT TAAT ATATT TATT TAT T T T T TGCTAT T TA 

y AT GTATT TAT T T T T TTAC TTGGACATGAAAC T T TAAAAAAAT T CACAGAT T ATAT T TATAACC TG 

Til 

ACTAGAGCAGGTGATGTATTTTTATACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGG 
CT AGGGGGG T TAT TCAT T TGTAT TCAACTAAGGACATAT T TAC TCAT GC TGATGC T C T GTGAGAT 
AT TTGAAAT TGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTT TGATGTGGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATC C TACACGGCCAGCAT GTAT T TC TACAAAT AAAGT TT T CTT TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 128 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSE IRGS VQAKDGN I D I R I LRRTE S LQDTKPANRCCLLRHLLRLYLDRVFKN YQT PDH YTLRKI S 

SLANSFLTIKKDLRLSHAHMTCHC^ 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAGACAAGCAAAAGCAGC T GGAAAAAGAATAGT CT TT TATGGAGAT GAAACC TGGGT TAAAT TA 
T T CCCAAAGCAT T T TGTGGAATAT GAT GGAACAACCTCAT T T T TC GT GT CAGAT TACACAGAGG T 
& GGAT AATAAT GTCACGAGGCAT T T GGATAAAGTAT TAAAAAGAGGAGAT T GGGACATAT TAATCC 

z*. T C CAC T AC C T GGG GC T GGAC C ACAT T G G C CAC AT T T CAG G GC C C AAC AG C CC C C T GAT T GGGCAG 

i AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
=1 GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
[f GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
J CCCGGTGATATCCGAGATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
ft ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
| a S CAATGAGAGAGCAGTTGAGATTTTTACATTTGAATACAGTGCAGCTTAGTAAACTGTTGCAAGAG 
|f AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
1 GAACT GGATCAGAC TGTAC T TGGAGGAAAAGCAT T CAGAAGTCC T ATT CAAC CTGGGC TCCAAGG 

TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
?r ; TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Z CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 

I CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 

II GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
jj AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
•| TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
% CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
si TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAG TATGTAGT TACCAAAAGAATAAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE mo 



r. -.; i 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGWFMPYTTYL^ 

VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSE TGS HGAS S TEEVNTPL I L I S S AFERKPGD I RHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 



in amino acids 58-76 



N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites . 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCAC GAGGCAAGCC T T CCAGGT TAT CGT GACGCACC T TGAAAG T C T GAGAGC TAC TGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TC CCTATAGAAAACAAC TGCCAG CACCTTAAGACCAC TCACACCT TCAGAGT GAAGAAC T TAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAG T T CC AGAT AAAAAC T AC ATACGCC C AGAGATC T TC T T T GC AT T AGCC T C ATCC T TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GTGACAAGGATAAAGGACAAAGT CAT CCAT CCC TTCAGCTGAAGAAGGAGAAACTGAT GAAGCT 

Jpfc GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 

CI ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAATTTGAGAACAGGAAACACAT TGAATT TTCAT TTCAACCAGTT TGCAA 

J* I AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 

P AT T TGAAC TAAT TGTAT AAAAACACCAAACCTGCTCAC T 

CI I 
111 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALAS SLS S AS AEKGS P I LLGVSKGE FCL YCDKDKGQSHP S LQLKKEKLMKLAAQKES ARRP FI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-rayristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

CT AGAGAGTATAGGGCAGAAGGATGGCAGAT GAGT GAC T CCACAT CCAGAGC TGC C T CCC T T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGGGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACAT CT TGAAT TCCT GGAAGGAGCAGGTGGAGTC CAAGAC TGTAT T C T CAAT GGAGC TAC TGCT 
GGGGAGAAC T AGGTGT GGGAAAT T T GAAGACGACAT TGACAACTGCCAT T TCCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCTTGAGCCTAATCATGTAGTGTAGATCATTAAACATCAGCATTTTAAGAAAAAAAAAAAl 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHE 

YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQSSTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of" the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE las 

CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCAT CAAGCACAGAAT CAAGTGGAAC CGGAAGGCCCT GCCCAGCACTGCCCAGAT CA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCGAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAAC TCT TAGAGGTAG 
GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CT C TAAT GAAAT T G TGAAAGC TCCAT G T T TAGAAATAAATGAAAACACC T GA 
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FIGURE 146 

MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITSAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEraLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 11-11, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSSSCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



I r . s N-myristoylation site. 

D amino acids 62-68 

i 
m 

til 

m 
m 

If! 



a 
m 
o 
m 

Q 

ru 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGT GCAGAAAGACCT T TGAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
N s TAAGAAGCT CC CCACT T GATAATAAGT T GAAT GT GGAAG ATGTT GAT T CAACCAAGAAT CGAAAACT G 

5!? ATCGATGAT TAT GACT CTACTAAGAGT GGATTGGATCATAAAT T T CAAGAT GATCCAGAT GGTCTTCA 

5? TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
l\l AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
m CAAGCACATACACTGGAAGATGAAGTAGCAGAGGT T TTACAAAAATTAATCT CAAAGGAAGCCAACAA 

g! TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
III T GACT CCAATGGCAGCAATTCAAGATGGT CTTGCTAAGGGAGAAAACGATGAAACAGTAT CTAACACA 

* TTAAC CT T GACAAAT GGCTTGGAAAGGAGAACTAAAACCTACAGT GAAGACAACTTT GAGGAACTCCA 

f 1 

Tl ATATT TCCCAAATTT CTATGCGCTACT GAAAAGTAT TGAT T CAGAAAAAGAAGCAAAAGAGAAAGAAA 

'ij CACTGATTACTATCATGAAAACACTGATTGACTTTGTGAAGATGATGGTGAAATATGGAACAATATCT 
ffj CCAGAAGAAGGT GTT TCCTACCTT GAAAACT TGGATGAAAT GATT GCTCTT CAGACCAAAAACAAGCT 

[•! AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
f|| CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAAT AT TGAAT GGTT GAAGAAACATGACAAAAAGGGAAATAAAGAAGATTAT GACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTT TCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGAT TAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE iso 

MG FLGTGTW I LVL VLP I QAFPKPGGSQDKS LHNRELSAERPLNE Q I AE AEEDKI KKT YP PENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS I RS S PLDNKLNVEDVDS TKNRKL I DDYDS TK5 GLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLI SKEANNYEEDPNKPTSWTENQAGKI PEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTLI DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites : 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGC T TCTAGC TGGAGGGCTGCATGCAGGGAAGGTCAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
C C AG T G CC T G TC AT GTGGGGTGGGG CAG GAG C C GAC T C T AAC AC T AG AGCCAG T GAAC AT CAT GG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
Ls CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
f **_ CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
O TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
Q GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
If] GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
Tl TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
JjJI- GAAATGGCTCGAGCTCAGAAGATAAAAGATAAGTAGGGTATGCTGATCCTCTTTTAAAAACCCAA 
0" GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
Ul AGACC TACT T ACAAAGTGGCATAT AT TGCAAT T TAT T T TAAT TAAAAGATACCTAT T TAT AT AT T 

!j! TCTT TATAGAAAAAAGTC TGGAAGAG T T TAC T TCAAT T GTAGCAAT GTCAGGGTGGTGGCAGTAT 

J AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
JL AT TCCT TGTATAAAAATAAGAAAAGAAATTAATCTTGAGGTAAGCAGAGCAGACATCATCTCTGA 

W TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
(11 TTGTAGTAGTGATCAGGAAACA^ATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
Q GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
»j GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
Jlf ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GT CT CT GCAGAT GTAGT TAGT TAAGACAAGGT CATGC T GGAT GAAGGTAGACC TAAAT TCAAT AT 
111 GACT GGTT TC C T T GTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TAT GTA 

AAGATGAAGGCAGAGATCGGAGTT TTGCAGCCACAAGC TAAGAAACACCAAGGATTGTGGCAACC 
AT CAGAAGC T TGGAAGAGGCAAAGAAGAAT TC TT C CC TAGAGGC T T T AGAGGGATAACGGC T C T G 
C T GAAACC T TAAT CTCAGAC T TCCAGCC TCCTGAACGAAGAAAGAATAAAT T T CGGC TG TT T TAA 
GCCACCAAGGATAAT TGGTTACAG CAGC TC TAGGAAAC TAATAC AGCT GCTAAAATGAT CCCT GT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATAT GGCAGAAGT GAT GGCATGCC AC T TCCAAGATTAGGT TATAAAAGACAC T GCAGC T TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTC TGTGAGATAAT CGATGT TTGTTGT TTT 
AAGTT GCT CAGT T TT GGTC TAACT TGTT AT GCAGCAATAGATAAAT AATAT GCAGAGAAAGAG 
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FIGURE 152 

MVLS GALC FRMKDS ALKVL YLHNNQLLAGGLHAGKVT KGEEI SWPNRWLDAS LSPVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites* 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE isa 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
T TCCAGCAGCCC TATAT CACCAACCGCACCT TCATGCT GGC TAAG GAGGCT AGC T TGGC TGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
T AT TGAAGGT GAT GACCTGCAT AT CCAGAGGAAT GTGCAAAAGCT GAAGGACACAGT GAAAAAGC 
T TGGAGAGAGTGGAGAGATCAAAGCAAT TGGAGAAC TGGAT T TGC T GT T TAT GTC T C TGAGAAAT 
GCCTGCATT TGA CCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
C AAT T AGAT G CC C CAAAGCGAT x T T T T T TAAC CAAAAGGAAGAT G GG AAG CCAAAC T C CATC AT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACTTTCTAAGCATAGATATTTATTGATAACATTTCATTGTAACTGGTGTTC 
TATACACAGAAAACAAT TTATTTT TTAAATAATTGTCTTT TTCCATAAAAAAGATTACTT TCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
TGTAT TT AT TATTATTATAAGACTGCAT TT TATT TATATCATTT TATTAATATGGATT TATTTAT 
AGAAACATCAT TCGATAT TGC TACT T GAGT GTAAGGCTAATAT T GATATT TATGACAATAAT TAT 
AGAGC TATAACATGT T TAT T T GACC T CAATAAACAC T TGGATAT CCC 
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FIGURE ika 

MAALQKSVSSFLMGTLATSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTFMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRL^ 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

O N-glycosylation sites. 

Jf amino acids 54-58, 68-72, 97-101 

r|| N-myristoylation sites. 

|y amino acids 14-20, 82-88 

L, Prokaryotic membrane lipoprotein lipid attachment site. 

i'm amino acids 10-21 

Gil 
?*% 

as s 

m 
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FIGURE iss 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GC T GCT CT ACCACAACCAGACTGT CT T C TACAGGCGGCCAT GCCAT GGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T T GACAT GGAT GAT T C T GAGGAGGAAGC T GT TAT TGAAT GT AT AGAGAT T TAT CC AAAT AAAT AT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT T GAGGGAC CT CCGAGT AGAACCTGT TACAAC TAGTG T TGCAACAGGGGAC TAT T CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAGTCTAT T T CAT T GGGGCCCATAATAT T CC TAAT GCAAAT AT GAATGAAGAT GG 
y 5 CCCT T CCAT GTC TG TGAAT T TCACC TCACCAGGC TGCC TAG AC C AC AT AAT GAAATATAAAAAAA 

p AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
O GT AGAAG TGAAC T TCACAACCAC TC CC CT GGGAAACAGATACAT GGCT CT TATCCAACACAGCAC 

m 

; ? ; TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TT CCAG T GAC T GGGGAT AG T GAAG G T GC T AC G G T GCAGC T GAC T C CAT AT T T T CC T AC T T G T GGC 
HI AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
111 GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
* CATGGGTGC T GG TGGCAGGGATC TAT C TAAT G TGGAGGCACGAAAGGATCAAGAAGAC T T CC T T T 

*J* TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
%l TCACACAAT T TGT TAC T TCACTGAAT T T CT T CAAAACCAT T GCAGAAG TGAGGTCAT CCT TGAAA 

ft], AGT GGCAGAAAAAGAAAATAGCAGAGAT GGG T CCAGT GCAGTGGCT TGCCACT CAAAAGAAGGCA 

0 GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
i'W GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GT GAT CT AAGAAGCCAGAT TCAT CTGCACAAATACGTGGTGGT CT ACT T T AGAGAGAT T GATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSIJyaCRSAVPREPTVQCGSETGPSPEW14LQHDLIPGDLRDLRVEPVTTSVATGDYSIIMNVSWV 
LRADASI RLLKATKICVTGKSN FQSY SCVRCNYT EAFQTQTRPSGGKWT FSY IGFPVELNT VY FIGAHNI P 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVECAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASVVIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIECVLVVYPSEICFHHTICYFTEFL 
QNHC RS E V I L E KWQ KKKI AEMG P VQWLAT Q KKAADKW FLL S ND VN S VCDGT CG KSEGSPSENSQDLF PL A 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAAG AGGTGCATATCC 
ACT CAGCT GAAGAAG 



III 

m 



yi 



CO 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNI ESRS TS PWNYTVT WDPNRYPSE WQAQCRNLGC I NAQGKED I SMNSVP I QQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 



N-glycosylation site. 

amino acids 83-87 

.CP N-myristoylation sites. 

til 

CP 

m 



amino acids 106-111, 136-141 



rii 
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FIGURE 161 



ACAC TGGCCAAACAAAAACGAAAGCACT CC GTGC TGGAAGTAGGAGGAGAG T CAGGAC T CCCAGG 
ACAGAGAGTGCACAAAC TACCCAGCACAGCC CCC T CC GCCCCCT CT GGAGGC TGAAGAGGGAT TC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGaAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAG CC T GAAGAT GAGGAAAAG T T T G G AGGAG C AGC TGAC T C AG GG G T G GAGGAGC C T AGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAAC T CAACCACACACAGCAGCT GCCT GCCCT GCCC T GGC T CAACGT GTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTT.ee 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDlSr^HLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHEOIWALWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACAT CC T GACGTGGGACAGC GGGCCAGAGGGCACC CCAGACACGGTC TACAGCATCGAGTATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T T CTACCAC T TAGAGC TCCAGGTCAACCGCACCT ACCAAATGCAC CTT GGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCAT AGCC TGT CCGAGATCACC TAC T TAGGGCAGC CAGACAT CTCCAT CCT CCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

CCCTGGGGATTTGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAAT GACAGTGCAAGGAGGAAAT GCAGGGAAAC T C C CGAGG T CCAGAGCCCCACCTC 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

T T CCCAGCCAGGGCAAC TGCC TGACGT T GCACGAT T TCAGC T T CATTCC T CT GATAGAACAAAGC 

GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTTTCTGCAGGCAGGAGTTTCAGACCCT 

AT CCT GAGAATGGGGT T TGAAAGGAAGG TGAGGGC TGT GGC CC C T GGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

AT C T G T AAT GT GG GGAT C AT AACACC T ACC T CAT GGAGT TG T G G T GAAGAT GAAAT GAAG T CAT G 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHWFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
E FLGT IMI CVPTWAKESAPYMCRVKTLPDRTWT YS FSGAFL FSMGFLVAVLCYLS YRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLS'SVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence . 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATG G 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGG.CGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGT TGAACACAGTTT TGGATATTTTCCAAAAGATT TGATCAAGGTACTT CATAAATACACGGA 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGACT TTGTCTGCTT TGAAGGAGGAAGAGATGATT 
T T AATAG T TATAAT G TAGAAGAGC T T T TAG GAT C T T T GGAAC T G G AGGAC T C T GT ACC T GAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
CAT TACAGCAAAGGAT T TCGT TGGCATCAAAATC TAAGT TT GTT T T ACAAAGATTGTT T T TAG TA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVXGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT TGAGAATGAAGCAGAACAGAAGT TAATAGAGAGCATGT TGCAAAACCTGACAAAACCCGGG 
ACAGGGAT T TC TGAT GGT GATT TC TGGAT AGGGC T T TGGAGGAAT GGAGAT GGGCAAACATC T GG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
tl* ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
y. CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
Cl CAAG TAT GAACCAGAGAT TAATCCAACAGCC CC TGTAGAAAAGC CT TATCT T ACAAATCAACCAG 

3| GAGAC AC C CAT C AGAAT GTGGTTGT T AC T GAAGC AGG T AT AAT T C C C AAT C T AAT T TAT G T T G T T 

f,j ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
JJJ GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGT C TACACTGTGGATT TCAAAGAGTA 

JJ* CC AGAAAAGAAAG T GG C AT GGAAG T ATAAT AAC T CAT T GAC T T G G T T C C AGAAT T T T GT AAT T C T 

III GGAT CTGTATAAGGAAT GGCATCAGAACAATAGC T TGGAATGGC T T GAAA.T CACAAAGGATCTGC 

ifl AAGAT GAAC T GTAAGCTCCCCCT T GAGGCAAATAT TAAAGTAAT T T T T ATAT GTC TAT TATT T CA 

TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
^ ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
II] TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
Q AT GAAC T GT T C TAATAT T TAT T T T TAT GGCATC TCAT T T T T C AAT ACAT GCT C T T T TGAT T AAAG 

f|| AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 
JbJ TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
™* AATAAGAAGCTAT T TCAT TAAGT GT GATATAAACCTCCTCAAACAT TT TACT TAGAGGCAAGGAT 

III T GT CT AAT T T CAAT T GTGCAAGACATGT GCC TT ATAAT TAT T T T T AGC T TAAAAT TAAACAGAT T 

TT GTAAT AATGTAACT T TG T TAATAGGT GCATAAACACTAATGCAGTC AATT T GAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
AC TAT AT TAGTATACAAAGAGGT CAT GT GGTTGAGACCAGGT GAAT AGT CACT AT CAGT GTGGAG 
ACAAGCACAGCACACAGACAT TT TAGGAAGGAAAGGAAC TAC GAAATCG TGT GAAAATGGG T TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
S TLW I SKSTRKES GMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-2 69 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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